This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




(43) International Publication Date 00) International Publication Number 

8 March 2001 (08.03.2001) PCT WO 01716764 Al 



(51) International Patent Classification 7 : G06F 13/00 

(21) International Application Number: PCT/USOG/23479 

(22) International Filing Date: 28 August 2000 (28.08.2000) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

09/386,264 31 August 1999 (31.08.1999) US 

(71) Applicant: RT1MAGE INC. [US/US J; Suite 290, 1111 
Bayhill Drive, San Bruno, CA 94066 (US). 

(72) Inventors: DEKEL, Shai; Dekel Street 3/36 f Or- Yehuda 
(IL). OVS1ANKIN, Alexander; Apartment 44, 3 Dakar 
Street, Lod (IL). 

(74) Agents: KIRSCH, Gregory, J. et al.; Needle & Rosen- 
berg. P.C., The Candler Building, Suite 1200, 127 
Peachtree Street, N.E., Atlanta, GA 30303-1811 (US). 



(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU. CZ, 
DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, EL IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN. MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ. TM, 
TR, TT, TZ, UA, UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS. MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ. BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB. GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW. ML, MR, NE, SN, TD, TG). 



Published: 

— With international search report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'* appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



= (54 ) Title: SYSTEM AND METHOD FOR TRANSMITTING A DIGITAL IMAGE OVER A COMMUNICATION NETWORK 









KB WCMSR A. 


H*ONC QJEW 


IMMG 

aoc on 


^ \ 

OJQR COMD 

11D y 




o 
O 



(57) Abstract: The imaging system that is described below is an image streaming system which is different from traditional com- 
pression systems. It eliminates the necessity to store a compressed version of the original image, by streaming ROI data using the 
original stored image. It avoids the computationally intensive task of compression of the full image. Instead, once a user (1 10) 
wishes to interact with a remote image, the imaging server (120) performs a fast preprocessing step in near real time after which it 
can respond to any ROI requests also in near real time. When an ROI request arrives at the server (120), a sophisticated progressive 
image encoding algorithm is performed, but not for the full image. Instead, the encoding algorithm is performed only for the ROI. 
Since the size of the ROI is bounded by the size and resolution of the viewing device at the client (1 10) and not by the size of the 
image, only a small portion of the full progressive coding computation is performed for a local area of the original image. The client 
computer (110) performs decoding and rendering only for ROI and not the full image. The present system supports several modes 
of progressive transmission: by resolution, by accuracy and by spatial order. 
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SYSTEM AND METHOD FOR TRANSMITTING A DIGITAL IMAGE OVER 
A COMMUNICATION NETWORK 



Background of the Invention 

5 

1. Field of the Invention 

This invention relates to systems and methods for transmission of still images 
over relatively low-speed communication channels. More specifically the invention 
1 0 relates to progressive image streaming over low speed communication lines, and may 
be applied to a variety of fields and disciplines, including commercial printing, medical 
imaging, among others. 



15 



2. Brief Description of the Prior Art 



In a narrow bandwidth environment, a simple transfer to a client computer of 
any original image stored in a server's storage is obviously time consuming. In many 
cases the user only wishes to view a low resolution of the image and perhaps a few 
more high resolution details, and so a full image transfer is inefficient. This problem 

20 can be overcome by storing images in some compressed formats. Examples for such 
formats are standards such as Progressive JPEG, FlashPix, the upcoming JPEG2000, 
^nd some recent proprietary so-called wavelet formats. Some of these formats allow 
progressive transmission (of the full image) and some allow transmission of only region 
of interest (ROI) data, thereby avoiding the necessity to transmit the entire image 

25 before it can be rendered at the client side. 

But there are at least three main problems with the existing solutions: first, one 
needs to prepare and Store a compressed image of this type for any original image 
located in the storage. In many cases this results in storing both the original image and 
30 the compressed version. Sometimes the size of the file prepared for transmission is 
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even larger than the original file. For example, a lossless FlashPix representation of a 
given image is typically 4/3 the size of the original image. Secondly, the computation 
of any of the above formats is much more intensive than regular compression 
techniques such as the base-line version of JPEG. The last problem is that the existing 
5 methods generally do not support efficient extraction and transmission of ROI data, but 
rather the whole image in progressive mode. The FlashPix format that does support 
RQI streaming is not a wavelet-based format and therefore cannot take advantage (as 
wavelets do) of the relations between the various resolutions of the image. 

10 The above-mentioned problems with the prior art result in inefficient systems 

both in architecture and data handling workflows. This tends to be the case for imaging 
applications such as in the graphics arts and medical environments, where image files 
are relatively large and they are created dynamically as external events dictate. 

j 5 Summary of the Invention 

The imaging system that is described below is an image streaming system 
which is different from traditional compression systems and overcomes the above 
problems. It eliminates the necessity to store a compressed version of the original 

20 image, by streaming ROI data using the original stored image. The imaging system of 
the present invention also avoids the computationally intensive task of compression of 
the full image. Instead, once a user wishes to interact with a remote image, the imaging 
server performs a fast preprocessing step in near real time after which it can respond to 
any ROI requests also in near real time. When a ROI request arrives at the server, a 

25 sophisticated progressive image encoding algorithm is performed, but not for the full 
image. Instead, the encoding algorithm is performed only for the ROI. Since the size of 
the ROI is bounded by the size and resolution of the viewing device at the client and 
not by the size of the image, only a small portion of the full progressive coding 
computation is performed for a local area of the original image. This local property is 

30 also true for the client. The client computer performs decoding and rendering only for 
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ROI and not the full image. This real time streaming or Pixels-On-Demand™ 
architecture requires different approaches even to old ideas. For example, similarly to 
some prior art, the present imaging system is based on wavelets. But while in other 
systems wavelet bases are selected according to their coding abilities, the choice of 
5 wavelet bases in the present imaging system depends more on their ability to perform 
well in the real time framework. The system of the present invention supports several 
modes of progressive transmission: by resolution, by accuracy and by spatial order. 

Brief Description of the Drawings 

10 

Figure 1 is an overall system architecture block diagram for the present 
invention, in one embodiment. 

Figure 2 is an overall system workflow diagram for the present invention, in one 
15 embodiment. 

Figure 3 depicts, for a subband tile, spatial grouping of subband coefficients at a 
given resolution. 

20 Figure 4 is sample pseudo-code describing a bit plane scan process, forming a 

part of an encoding process. 

Figure 5 depicts the visualization of a data block having x, y, resolution and bit 
plane dimensions. 

25 

Figure 6 is a workflow diagram describing how a "progressive by accuracy" 
request list for a region of interest may be composed. 

Figure 7 is a workflow diagram describing the "progressive by accuracy" 
30 process for a client. 
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Figure 8 is a diagram describing the server workflow. 

Figure 9 depicts preprocessing steps that may be performed by the server. 

5 

Figure 10 is a diagram depicting the multiresolution structure used in the 
preprocessing step. 

Figure 1 1 is a flow diagram describing low resolution encoding. 

10 

Figure 12 is a flow diagram describing region of interest high resolution 
processing. 

Figure 13 is a flow diagram describing the outer loop of the server encoding 
15 algorithm. 

Figure 14 is a flow diagram describing a bit plane significance scan of the 
server encoding algorithm. 

20 Figure 1 5 is sample pseudo-code describing the bit plane scan of the client 

decoding algorithm. 

Figure 1 6 is a flow diagram describing the outer loop of the client decoding 
algorithm. 

25 

Figure 17 is a flow diagram describing the subband tile extraction of the client 
progressive rendering process. 

Figure 18 is a diagram depicting the visualization of the multiresolution 
30 memory strip data structure during the preprocessing algorithm. 
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Figure 19 is sample pseudo-code describing memory constraint subband tile 
preprocessing as pan of the preprocessing algorithm. 

5 Figure 20 is a diagram describing the tiles at the local separable forward 

subband transform. 

Figure 21 is a diagram depicting the local inverse subband transform. 

10 Figure 22 is a flow diagram describing the progressive rendering steps. 

Figure 23 is sample pseudo-code describing the recursive multiresolution march 
in the client progressive rendering process. 

15 Detailed Description of the Invention 

1. Notation and Terminology 

The following notation of Table 1 is used throughout this document. 

20 Table 1 



Notation 


Definition 


4D 


Four dimensional 


DCT 


Discrete Cosine Transform 


FIR 


Finite Impulse Response 


FWT 


Forward Wavelet Transform 


GUI 


Graphical User Interface 


ID 


Identification tag 


IWT 


Inverse Wavelet Transform 


ROI 


Region Of Interest 


URL 


Uniform Resource Locator 
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The following terminology and definitions of Table 2 apply throughout this 
document. 



Table 2 

5 



Term 


Flpfin itinn 

JL/Cl 111 1 11LFU 


Rendering 


Procedure to output/display a region of interest (ROI) of an 
l'minp \r\ q Hpvirp ciirh as a monitor onnter. etc. 


Multiresolution 


a r\^n-*Tr\\A'*\ ctnirTurf* rPTirp 4 ! enti n 0 an imace at dvadic 
resolutions, beginning with the original image as the highest 
resolution. 


Subband Transform / 
subband coefficients 


a mpthnH nf Hpmmnn<;inp an ima&e (time domain) to 
frequency components (frequency domain). A representation 
nf an imape as a sum of differences between the dyadic 
resolutions of the image's multiresolution. 


Wavelet l ransiorm / 
Wavelet coefficients 


A ^nerial case of Subband Transform. 


Quantization 


A process that, given a number, removes some precision from 
it, such that it will be possible to code it in lower precision 
using less bits of memory 


Threshold 


A process that, using a given positive constant, sets to zero any 
number whose absolute value is below the constant. 


Progressive Transmission 


Transmitting a given image in successive steps, where each 
step adds more detail to the rendering of the image 


Progressive Rendering 


A sequence of rendering operations, each adding more detail. 


Progressive by accuracy 


Transmit/render strong features first (sharp edges), less 
significant features (texture) last 


Progressive by resolution 


Transmit/render low resolution first, high resolution last 


Progressive by spatial 
order 


Transmit/render top of image first, bottom of image last 


Distributed database 


A database architecture that can be used in a network 
environment 


Subband/Wavelet tile 


A group of subband/wavelet coefficients corresponding to a 
time-frequency localization at some given spatial location and 
resolution/frequency 


Subband/Wavelet data 
block 


An encoded portion of a subband/wavelet tile that corresponds 
to some accuracy layer 
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2. Overview of the Invention 

Referring to Figure 1 , a block diagram is provided depicting the various 
5 components of the imaging- system in one embodiment. A client computer 1 10 is 
coupled to a server computer 120 through a communication network 130. 

In one embodiment, the client computer 1 10 and server computer 120 may 
comprise a PC-type computer operating with a Pentium-class microprocessor, or 
10 equivalent. Each of the computers 1 10 and 120 may include a cache 111,121 

respectively as part of its memory. The server may include a suitable storage device 
122, such as a high-capacity disk, CD-ROM, DVD, or the like. 

The client computer 110 and server computer 120 may be connected to each 
15 other, and to other computers, through a communication network 130, which may be 
the Internet, an Intranet (e.g., a local area network), a wide-area network, a wireless 
network, or the like. Those having ordinary skill in the art will recognize that any of a 
variety of communication networks may be used to implement the present invention. 

20 With reference to Figure 2, the overall system workflow is described. In one 

embodiment, using any browser type application, the user of the client computer 110 
connects to a Web Server 140 or directly to the Imaging server 120 as described in 
Figure 1. In step 201 ? the user then selects, using common browser tools, an image 
residing on the image file storage device 122. The corresponding URL request is 

25 received and processed by the imaging server 120. In the case where the results of 
previous computations on the image are not present in the imaging cache 121, the 
server 120 performs a fast preprocessing algorithm (described in further detail later in 
section 6.1). The result of this computation is inserted into the cache 121. 
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Unlike other more traditional applications or methods which perform full 
progressive encoding of the image in an offline type manner, the goal of the 
preprocessing step 202 is to allow the server 120, after a relatively fast computational 
step, to serve any ROI specified by the user of the client computer 1 10. For example, 
5 for a 75M RGB (color) image, using the server 120 of the present invention, installed 
on a computer with a Pentium processor, fast disk 122, running Windows NT, the 
preprocessing step 202 will typically take 5 seconds or less. This is by order of 
" magnitude faster than a full compression algorithm such as is described in the prior art, 
such as in J. M. Shapiro, "An embedded hierarchical image coder using zero-trees of 
10 wavelet coefficients", IEEE Trans. Sig. Proc, Vol. 41, No. 12, pp. 3445-3462, 1993; A. 
Said and W. Pearlman, "A new, fast and efficient image codec based on set 
partitioning", IEEE Trans. Circuits and Systems for video Tech., Vol. 6, No. 3, pp. 243- 
250, 1996; orD. Taubman, "High performance scalable image compression with 
EBCOT", preprint, 1999. 

15 

Serving a ROI is described herein as Pixels-On-Demand™, which identifies a 
progressive transmission of any ROI of the image essentially in real-time, where the 
quality of the view improves with the transfer rate. Once the preprocessing stage is 
done in step 202, the server 120 sends to the client 110 a notification that the image is 
20 ready to be served. The server 120 also transmits the basic parameters associated with 
the image such as dimensions, color space, etc. Upon receiving this notification, the 
client 1 10 can select any ROI of the image using a standard graphical user interface. 

The ROI is formulated in step 203 by the client 1 10 into a request list that is 
25 sent to the server 120, Each such request corresponds to a data block (described later in 
further detail in section 3). The order of requests in the list corresponds to a progressive 
mode selected in the context of the application, such as progressive by accuracy 
rendering of the ROI. Upon receiving the ROI request list, the server 120 processes the 
requests according to their order. For each such request the server 120 checks if the 
30 corresponding data block already exists in the cache 12L If not, the seiner 120 then 
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computes the data block, stores it in the cache 121 and immediately sends it to the 
client 110. Once a data block that was requested arrives at the client 1 10, it is inserted 
into the cache 111. At various points in time during the transfer process, a decision rule 
invokes a rendering of the ROI by the client 110. 

5 

If some of the data blocks required for a high quality rendering of the ROI were 
requested from the server 120, but have not arrived yet at the client 1 10, the rendering 
of the ROI will be of lower quality. But, due to the progressive request order, the 
rendering quality will improve with each received data block in an optimal way. In 
10 case the user changes the ROI, the rendering task at the client 1 10 is canceled and a 

new request list corresponding to a new ROI, sent from the client 1 10 to the server 120, 
will notify the server 120 to terminate the previous computation and transmission task 
and begin the new one. 

15 3. Imaging protocol and distributed data-base 

The imaging protocol serves as a common language between the server 120 and 
its clients 1 1 0, and is mainly used for the following purposes: 

20 1. By the client 1 10 to efficiently stream to the server an ordered request 

list of data blocks required for progressive rendering of a given ROI. 

2. By the server 120 to efficiently compose and send to the client 110 a 
stream of (requested) data blocks. 

25 

Both client 110 and server 120 use the same distributed database associated 
with the protocol, such that they are using the same tagging system for the various data 
blocks. Using this approach it is also possible to cache the data blocks at both ends. 
The server 120 is able to cache (in cache 121) any computed data block, such that this 
30 data block can be sent to any other client 110 that requests it. The client 1 10 is able to 
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cache (in cache 1 1 1) any data block received, such that for any subsequent ROI 
rendering operation requiring the data block, it will be extracted from the client cache 
1 1 1 and not ordered from the server 120. The protocol is designed such that the 
imaging system will have efficient ROI control and rendering in any mode such as: 
5 progressive by accuracy, progressive by resolution and progressive by spatial order. 

-The basic components of the protocol are accuracy layers associated with tiles 
of subband/wavelet coefficients. To illustrate, suppose l(x,y) is a given single 
component image such as a grayscale image. It will later be shown how the protocol is 
10 generalized to arbitrary color spaces. Let FWT(l) (Forward Wavelet Transform) be a 
subband/wavelet transform of the image. For references describing such transforms, 
see the basic introductions contained in C. K. Chui, "An introduction to wavelets", 
Academic Press, 1992; I. Daubechies, "Ten lectures on wavelets", Siam, 1992; and S. 
Mallat, "A wavelet tour of signal processing", Academic Press, 1998. 

15 

As described herein, we require that the transform will be separable. A 2D 
separable transform can be computed by two ID steps: in the X direction and then in 
the Y direction of the given image. As can be seen in Figure 3, the transform typically 
creates for each given resolution/frequency j three subbands denoted as hl p lh } ,hh r 

20 Thus, we can associate the coefficients of FWT(l) to the structure 301 . Observe that 
each coefficient belongs to a certain resolution 302. Also, if a Finite Impulse Response 
(FIR) subband filter (compactly supported wavelet) is used, each coefficient influences 
only a compactly supported region of the image. High resolution/frequency 
coefficients are associated with a small region and low resolution/frequency 

25 coefficients are associated with a larger region. 

For the purpose of efficient rendering the coefficients may be subdivided into 
the tiles 303. Into each tile we group three subgroups of coefficients corresponding to 
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subbands hljjh p hhj2\ the resolution j , associated with the same spatial area. 
Typically the sub-tiles are chosen to have a length of 32, so that the tile contains 
3x 32 : coefficients. This relates to the fact that at the time of rendering at the client 
1 10, the subband tile will provide a means to render a tile of length 64. A parameter 
5 tileLength controls the length of the subband tiles 303 and can be changed dynamically 
per image. To each such tile 303 we associate a 3D coordinate 

(t _x,t __y y t ^resolution) (1.1) 

10 Note that: 

1 . The Discrete Cosine Transform (DCT) transform used in the standard 
JPEG [PM1] is certainly a FIR separable subband transform, but of a 
slightly different nature than wavelets. Using the common 8x8 DCT as 

15 in JPEG limits the number of possible resolutions to four. Also the DCT 

coefficients are not traditionally associated with resolutions or with any 
of the three subband types hljh.hh . Nevertheless it is possible to do so 
(see Z. Xiong, O. Guleryuz and T. Orchard, "A DCT-based embedded 
image coder", IEEE Signal Proc. Letters, Nov. 1996) and the present 

20 imaging protocol supports the DCT by grouping the coefficients into 

such a tiled structure. 

2. To generalize the above for more than one image component, simply 
associate with the tile the coordinates described above as (1.1) all the 
25 coefficients corresponding to the same spatial location of all the 

components. Thus, 3 x 32 2 * numberOfComponents coefficients are 
associated with a multidimensional tile. 
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How the subband tiles are further decomposed to subband data blocks will now 
be further explained. The motivation for this finer 4D decomposition is this: Each 
subband tile at the resolution j participates in the rendering of local regions in all the 
resolutions > j . Assume j corresponds to a very low resolution. It turns out that for a 
5 high quality rendering of a given ROI at the resolution j , only the coefficients with 
high absolute value are required. Transmitting small coefficients will not have any 
significant visual effect at the time of rendering, and thus should be avoided. Also, 
transmitted coefficients need not be sent at a high precision since this also will have 
visual significance to the rendering operation. Rather, for each such significant 

10 coefficient only a few bits of precision should be transmitted along with the 

coefficient's sign. Rendering the same ROI at a higher quality or rendering a higher 
resolution will require sending of more of the tile's coefficients and also more accuracy 
bits for the coefficients that were already sent. For this reason, each subband tile is 
further decomposed into accuracy data blocks. Each data block is associated with the 

1 5 "bit plane" interval [0, 2) at level 0, or the intervals [2' , 2'*' ) at level / > 1 . 

As illustrated in Figure 5, each data block of the subband tile (1.1) will have a 
4D coordinate 

20 (t _x,t _y,t _ resolution^ _bitPlane) (1.2) 

where 0 < t _ bitPlane < maxBitPlane(t _ resolution) . x, y, resolution and bit plane are 
depicted in Figure 5 as reference numerals 501, 502, 503 and 504, respectively. The 
value of maxBHPlane(t Resolution) corresponds to some predetermined constant oris 
25 selected dynamically according to the largest coefficient at the resolution t _ resolution 
of the given image. For example, maxBitPlane{t Resolution) = 8 seems to work well 
on most images. Adapting the well known bit plane technique described in J. M. 
Shapiro, "An embedded hierarchical image coder using zero-trees of wavelet 
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coefficients" IEEE Trans. Sig. Proc, Vol. 41, No. 12, pp. 3445-3462, 1993; and A. 
Said and W. Pearlman, "A new, fast and efficient image codec based on set 
partitioning", IEEE Trans. Circuits and Systems for video Tech., Vol. 6, No. 3, pp. 243- 
250, 1996, a data block having the coordinates described above at (1.2) contains the 
5 following data in encoded format: 

1 . For t _ bitPlane > 0 , a "list" of the indices of all subband coefficients 
whose absolute value is in the range [2'- biipian ',2 t - birPlane * t ) . 

10 2. The sign of all the coefficients of 1. 

3. In some cases an additional precision bit is added for any coefficient that 
belongs to the current bit plane or any higher bit plane. 

15 It may be noted that: 



1 . In case the subband tile has coefficients whose absolute value 

- - " , where maxBitPlane{t _ resolution) is not * 

dynamically selected, their bit plane information, which is the most 
20 significant, is inserted into the "highest" possible data block 

(' _ x,t _>'» t _ resolution, maxBitPlane(t _ resolution)) 

In such a case, this most significant data block contains the data of 
25 possibly several bit planes. 



In lossy image coding, it is preferable to threshold (discard) all 
coefficients whose absolute value is below some given threshold e > 0 . 
In such a case each data block having the coordinates identified above at 
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(1.2) is associated with the bit plane [rf'^'^.rf'-^*^ 1 ). Observe 
that coefficients whose absolute value is smaller than € are not reported 
in any data block and therefore never transmitted. 

3. There are cases where different thresholds were chosen for different 
types of coefficients. This might be done internally by a preprocess stage 
before the encoding algorithm (see further description later in section 
6.1.1), or externally in the case where the original image is a JPEG 
image already containing encoded quantized subband coefficients. In 
such a case, £ is chosen as the minimal threshold. Yet all the threshold 
values are used in the coding algorithm to predetermine the significance 
ofa coefficient at a certain bit plane [^2 l >■ <te ,rf l - M,w, ) and 
improve the coding efficiency. Furthermore, if a coefficient is known to 
be quantized, it can be handled more efficiently by the coding algorithm, 
and transmission of fine precision data can be avoided. This can be 
achieved by equipping the coding algorithm with the knowledge of the 
given quantization scheme. These exceptions are explained in further 
detail later in section 4.1.3.1. 

4. A simple generalization of the above for more than one component can 
be performed by associating the coefficients of all components ofa tile 
in the same absolute value range with the same data block. However, 
this might not be efficient in the case where one component is visually 
more significant than others. This is when the original image pixels are 
converted from their original color space into a coding color space such 
as YC S C R , YUV or YIQ (see A. Drukarev, "Compression related 
properties of color spaces", Proc. SPIE Vol. 3024, 1997; and N. 
Moroney and M. Fairchild, "Color space selection for JPEG image 
compression", J. Elec. Imaging, Vol. 4(4), pp. 373-381, 1995). In these 
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color spaces, the Y component represents the luminance of the color 
image and thus is more visually significant than the other two 
components which represent the color information. Furthermore, in 
lossy coding one typically applies smaller threshold to the luminance 
5 component. In such a case, assuming the coefficients of each 

component c are thresholded with s c > 0 , we insert to the data block 
(1.2) with t _bitPlane > 0 , for each component c, all of the coefficient 
data associated with the bit plane interval [^e e 2 t ' N ^ 9 s e 2 t ' M>lam ^ 1 ). 

This allows the server 120 to transmit for each ROI more data for 
10 visually significant components and less data for other components. 

4. The progressive subband coding algorithm 



Before the details of the actual coding algorithm used in the imaging system of 
15 the present invention are described, it is important to understand that it is possible to 
use other variants of the algorithm describe herein, with no modification of the present 
imaging system architecture. Furthermore, although a "non-Zero Tree type algorithm 
is described, with some modifications, the imaging system can work with a "Zero Tree" 
type coder. Zero tree algorithms are described in further detail in publications, such as 
20 J.'M. Shapiro,"An embedded hierarchical image coder using zero-trees of wavelet 
coefficients", IEEE Trans. Sig. Proc, Vol. 41, No. 12, pp. 3445-3462, 1993; A. Said 
and W. Pearlman, "A new, fast and efficient image codec based on set partitioning", 
IEEE Trans. Circuits and Systems for video Tech., Vol. 6, No. 3, pp. 243-250, 1996; 
and A. Said and W. Pearlman, u An image multiresolution representation for lossless 
25 and lossy compression", IEEE Trans. Image Proc, Vol 5., No. 9, pp. 1303-1310. 

The imaging system of the present invention does not, in one embodiment, use a 
"Zero Tree" method mainly because it is not very well suited for scalability. That is, 
"Zero Tree" methods are not efficient for low resolution regions of interest. In such 
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cases, "Zero Tree" algorithms actually transmit data associated not only with the ROI, 
but also with higher resolutions. The overhead is typically 5%-10% more data than the 
non-"Zero Tree" method described below. Since for large images the user rarely views 
the highest resolution, this overhead is significant. Furthermore, working in a "Zero 
5 Tree" framework requires the computation of relations between coefficients at different 
resolutions. Although this can be done rather efficiently with some modification of the 
imaging system, it is still an overhead that should be avoided. Nevertheless, the present 
invention could be adapted to use the "Zero trees" approach. 

I o A coding algorithm is generally composed of two modules: an encoder and a 

decoder. The encoder receives as input a group of, possibly thresholded or quantized, 
coefficients of a subband tile (1.1), encodes the data blocks (1.2) and stores them in a 
pre-designated location such as a slot in a database. The decoder, upon receiving data 
blocks, decodes them and reconstructs approximations of the original subband 

1 5 coefficients. The features of the coding algorithm of the present invention, in one 
embodiment, are: 

1 . It is relatively simpler and faster than other state of the art methods, such 
as that described in A. Said and W. Pearlman, "A new, fast and efficient 
20 image codec based on set partitioning", IEEE Trans. Circuits and 

Systems for video Tech., Vol. 6, No. 3, pp. 243-250, 1996; and D. 
Taubman, "High performance scalable image compression with 
EBCOT", preprint, 1999, but with comparable results. 

25 2. Thedatablock (( _x,t _y,t _ resolution,b 0 ) can be decoded only if the 

data blocks (t _x,t _y,t _resolution,b) with b>b 0 are present at the 
client side 110. However, this is not a real limitation, since in any 
rendering mode and for any given ROI, it would be desirable to have the 
data blocks b > b 0 before b 0 . Other than that, the data block does not 



BNSDOCIO: <WO 011€764A1J_> 



WO 01/16764 



PCTAJS00/23479 



17 



depend on any other blocks, which is important for efficient ROI 
transmission. 

3. The coding algorithm can be used for any subband or wavelet transform. 
5 In fact, with a few minor additional steps it can be used to efficiently 

stream regions of interest from JPEG files. Quantized DCT coefficients 
read from a JPEG file are coded by the algorithm such that once the true 
quantized value is understood by both encoder and decoder, no more 
precision bits are required for it. The quantized coefficient is 
10 reconstructed exactly as if the original full JPEG image was sent to the 

client 110. 

4. As with most algorithms of this type, the algorithm uses an arithmetic 
coder. The algorithm can use any reasonable arithmetic coding module, 

15 but it is especially designed to work with a binary arithmetic coder. 

Since a binary coder only codes two symbols, a much faster 
implementation is possible compared to a general arithmetic coder, such 
as described in A. Moffat, R. Neal, I Witten, "Arithmetic coding 
revisited", Proc. DDC (Snowbird, Utah), pp. 202-21 1, 1995. 



20 



4.1. The Encoding Algorithm 



The encoding algorithm of the present invention is performed at the server 120. 
As explained, the present imaging system is unique in that this rather time consuming 
25 task is only done locally in near real-time for a ROI, and not the full image. The 
encoding algorithm is described for images with a single color component, such as 
grayscale images, but of course may also be applied to images with multiple color 
components. The straightforward generalization for an arbitrary number of 
components will be explained later. 

30 
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The algorithm receives as input the following parameters defined in Table 3: 



Table 3 



Variable 


Meaning 


coef 


Matrix of subband coefficients. Typically 
3 x32 2 coefficients 


equalBinSize 


True for high bit rate (high quality). False for low bit rate 
(low quality) 


£< 


The minimal threshold for the component 



5 



The coding strategy is similar in some sense to that described in A. Said and W. 
Pearlman, "A new, fast and efficient image codec based on set partitioning", IEEE 
Trans. Circuits and Systems for video Tech., Vol. 6, No. 3, pp. 243-250, 1996, but with 
10 no "Zero Tree" data. At any given time in the encoding algorithm, a coefficient 
belongs to one of three groups: 

1. Typel6: A group of 4x4 coefficients {coef^i + xAj + y)}]^ 

2. Type* : A Group of 2x 2 coefficients {coef(2i + x 9 2j + y))\^ 
15 3. Typel: Single coefficient 

4.1.1. Encoding algorithm initialization 

In order to initialize the encoding algorithm, the following procedure is 
20 performed: 

1. Assign to each coefficient coef (.r,y) its bit plane b(x t y) such that: 

\coef(.x,y)\e[e ( 2\e c 2") 
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2. Compute the maximum bit plane over all such coefficients: 
maxBiiPlane[tile) - max {b{x,y)) 

5 3. Write the value of maxBitPlane(tile) using one byte as the header of 

the data block: 

(/ _ x, t _ y, t _ resolution, maxBitPlane{t _ resolution)) 

4. Initialize all the coefficients as members of their corresponding Typel6 
10 group. 

5. Initialize a list of significant coefficients to be empty. 



6. Initialize a coefficient approximation matrix coef as zero. 



15 



4,1.2* The outer loop 

20 The outer loop of the encoding algorithm is described in Figure 13. In step 

1301 and 1302, the encoding algorithm scans the bit planes from 

b - maxBitPlane(tile) to 6 = 0. The output of each such bit plane scan is the subband 
data block (1.2) described previously in section 3. Since the last stage of the encoding 
algorithm is arithmetic encoding of given symbols, at the beginning of each scan the 
25 arithmetic encoding output module is redirected to the storage area allocated for the 

particular data block. Once the bit plane scan is finished (step 1302) and the data block 
(1.2) has been encoded, the output stream is closed and the bit plane b is decremented 
(step 1303). 
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10 



4.1.3. Bit-plane scan 

The framework of the bit plane scan is described in Figure 14, while the 
pseudo-code is provided in Figure 4. The scan, for a given level b , encodes all of the 
coefficients' data corresponding to the absolute value interval [e2 b ,e2 M ) (see section 
3, described previously). 

4.1 .3.1. Significance Scan 



As with any progressive subband coding algorithm, the goal is to efficiently 
encode the locations of the coefficients that are exposed while traversing down to the 
current bit plane. Naturally, the encoder and decoder have at the time of the scan all 
the information on what took place at the previous higher bit plane scans. As with most 

1 5 methods, the heuristics of the algorithm is that coefficients which are insignificant at 
the current bit plane, which are coefficients with absolute value < el" are spatially 
correlated. If it is known that a coefficient is insignificant at the bit plane b then with 
"higher" probability so will his neighbors. This explains why the algorithm groups 
coefficients into groups of 16 and then 4 and tries to efficiently report (with one 

20 symbol) their insignificance. The significance scan uses four binary probability 
models: 

1 . Type] 6 model: used to model significance of the groups of 1 6. 

2. TypeA model: used to model significance of the groups of 4. 
25 3. Typel model: used to model significance of single coefficients. 

4. Sign model: used to model the sign of a significant coefficient. 

These models are simply histograms with two levels. They are initialized to 
equal probabilities at the beginning of each significance scan, and are used by and 
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updated by the arithmetic encoder each time a symbol is encoded. The significance 
scan is now described in further detail. 

Groups of 16 , As described in Figure 14, the first loop is on groups of 4x4 
. 5 coefficients. In step 1401, if there are no more such groups the significance scan is 

terminated and we proceed to the accuracy update step 1402 of section 4.1.3.2. In step 

1403, if the current group is still labeled Type\6 , then in step 1404, we compute 

b _ group - max b (x, y) and arithmetically encode the boolean value b _ group < b . 

.x,y€grouplb 

In step 1405, if b _ group < b , then the group is currently insignificant and we proceed 
10 to the next group (step 1401). Else, in step 1406, we remove the label Typel6 y split the 
group to 4 subgroups of 2 x 2 coefficients and label each of them Type4 . If the group 
of 16 was not labeled TypeX 6 then it must have been split at a higher bit plane. In both 
cases, we must now continue to handle the four subgroups of four coefficients. 

15 Groups of 4 . Beginning in step 1410, we loop over the subgroups of four 

coefficients. In step 141 1, if a subgroup is labeled Type4 , 

b ^subgroup = max b(x, v) is again computed, and the value of the test 

j, y€ Subgroup 

b _subgroup <b is arithmetically encoded. The arithmetic encoding step 1412 is not 
performed if the following three conditions hold: 

20 

1 . We are now at the fourth subgroup. 

2. The previous three subgroups are still labeled TypeA . 

3. The four subgroups were created just now at the current scan by splitting 
a group of 16, 

25 

Under these conditions, it is certain that b _ subgroup > b , else the split would 
not have taken place. Thus, there is no need to arithmetically encode the significance 
of the subgroup, since this information will be known also to the decoder at this stage. 
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In step 1413, if the current subgroup is found to be (still) insignificant, we 
proceed to the next subgroup (step 1410). Else, in step 1414, the label Type* is 
removed, the subgroup is split to the single coefficients and each of them is labeled 
5 TypeY. If the subgroup was not labeled Type4 , then it must have been split at a higher 
bit plane. In both cases we must now continue to handle each individual coefficient. 

Sin gle coefficient . In step 1415, we loop over the four coefficients. If a 
coefficient coef (x, y) is labeled Typel , the value of the test b(x,y) < b is 
10 arithmetically encoded. The arithmetic encoding step is not performed if the following 
three conditions hold: 

1 . We are now at the fourth coefficient. 

25 2. The previous three coefficients are still labeled Typel . 

3. The subgroup of the four coefficients was split just now at the current 
scan. 

20 Under these conditions it is certain that b(x, y) > b , else the split would not 

have taken place. As in the subgroup case, there is no need to arithmetically encode the 
significance of the coefficient. 

If the current coefficient is found to be (still) insignificant, we proceed to the 
25 next coefficient. Else, the sign of the coefficient is arithmetically encoded, the label 
Type] removed, and the coefficient is added to the list of significant coefficients. We 
now set 
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2 3x2* 
coef ( jc, v) = sign (coef(x, y)) xe c x — — 



The value of coef (x,y) simulates the reconstruction of the coefficient by the 
decoder. At this stage, with no additional information available, this will be the 
5 approximated value of the coefficient by the decoder, which corresponds to the middle 

of the bit plane interval [^2*,^ 2* +1 ) . 

4.1.3.2. Accuracy Update 

10 This step is performed if one of the following two conditions hold: 

1 . equalBinSize = false (see Table 3) 



15 



2. b>0 



The parameter equalBinSize is set to be true in high quality (high bit rate) lossy 
coding. In such a case, the value of s c will be already relatively small and the step 
described below at the bit plane b = 0 will increase significantly the data transfer for 
very little visual gain (see S. Mallat and F. Falzon, "Understanding image transform 
20 codes", Proc. SPIE Aerospace Conf, 1997). 

At the bit plane b , the significant coefficient list contains all the coefficients 
coef(x,y) for which b(x,y)>b and their current approximated value coef(x 9 y). 

For each coefficient in the list, the value of the test coef (x,y) < coef {x, y) is 

25 arithmetically encoded. To that end, a special binary probability model is initialized at 
the beginning of the step and used by the arithmetic encoder. After the accuracy 
improvement has been encoded, we then accordingly simulate the reconstruction done 
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10 



20 



by the decoder and update the coefficient's approximated value by adding/subtracting 
from coef(x,y) the amount 



2* 



Note that coefficients which belong to a group of coefficients known to be 
thresholded with thresh Z s c can be more efficiently encoded. For example, assume a 
coefficient is known (by both encoder and decoder) to be thresholded by thresh >2 l e e , 
k>l. Also assume that after scanning the bit plane associated with the interval 
[ff e 2*;* f 2*" ) , the coefficient is still insignificant (e.g., has not been reported as 
significant). In such a case, it is obvious that the coefficient will never become 
significant in any of the following bit plane scans and can be treated as zero from now 
on by both encoder and decoder. Furthermore, if a coefficient coef(x,y) is known to 
have been quantized using a quantization step T , its quantized value can be efficiently 
1 5 reconstructed. As soon as the approximated value coef {x, y) falls in a range of type 
(T x (n - 1) , T x (« + 1)) for some «eZ, the approximation takes the value 
coef(x,y) = 7*x n and no more accuracy bits are needed to be transmitted. 



4.2. The Decoding Algorithm 



Obviously, this algorithm is a reversed step of the encoding algorithm of section 
4.1 (previously described), performed in the server 120. The decoding algorithm is 
performed by the client computer 1 10 during the progressive rendering operation 705 
described later in section 5.5. Similar to the encoding algorithm, the decoding 
25 algorithm is described for an image with one component (such as a grayscale image), 
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but of course could also be used with an image with more than one component. The 
input parameters to the decoding algorithm are given below in Table 4: 

Table 4 



Variable 


Meaning 


coef 


Empty matrix of subband coefficients to be filled by the decoding algorithm. 


EqualBinSize 


True for high bit rate/high quality. False for low bit rate/low quality 
(transmitted from the server). 


s - 


The minimal threshold for the component (transmitted from the server). 



10 



4.2.1. Decoding algorithm initialization 

The initialization steps for the decoding algorithm are described below: 

1. Assign to each coefficient t?oef(x y y) the value zero. 



15 



20 



2. Initialize all the coefficients as members of their corresponding Type\6 
group. 

3. Initialize the list of significant coefficients to be empty. 

4. If the "first" data block 



(/ _ x, t _ y, t _ resolution, maxBitPlane(t _ resolution)) 
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is available at the client, read the first byte, which is the value of 
maxBitPlane(tile) . 

5 4.2.2. The outer loop 

The outer loop of the decoding algorithm is described in Figure 16. In steps 
1601-1604, the decoding algorithm scans the bit planes from the given 
b = maxBitPlane{tile) to b = 0 . The input to each such bit plane scan is encoded data 

10 block (t_x,t_y,t_resoIution,b). Since the first stage of the decoding algorithm is 
arithmetic decoding, at the beginning of each scan the arithmetic decoder's input 
module is redirected to read from a slot in the database allocated for the particular data 
block. There are cases where although a subband tile participates in the rendering of the 
ROI, some of its data blocks will be missing. A data block could be missing due to the 

15 following: 

1 It was not requested for the current ROI since it is visually insignificant. 
2. It was requested from the server, but has not arrived yet. 

20 

In any case, the tile's coefficients are required at some accuracy level for the 
rendering and the rule is obviously to use whatever data is present. At the first missing 
data block, the outer loop is terminated. For example, if the first data block is missing, 
all of the coefficients retain their initial value of zero. If only the first few data blocks 
25 are present then only coefficients with "large" absolute values will be reconstructed and 
at a low precision. In step 1604, once the bit plane scan is finished and the data block 
{t _x,t _y,t _resolution,b) has been decoded, the input stream is closed in step 1605, 
and the bit plane is decremented. 
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4.2.3. Bit plane scan 



A pseudo-code of the bit plane scan is provided in Figure 15. The scan, for a 
5 given level b , decodes all of the coefficients' data corresponding to the absolute value 

interval [rf\*2 w ). 



4.2.3.1.1. Significance Scan 

10 The goal of the significance scan is to decode the locations of the coefficients 

that are exposed when we traverse down to the current bit plane. Since the scan is a 
reversed version of the encoding scan described previously in section 4.1.3.1, the 
description below is brief. 

The first loop is on groups of 4x 4 coefficients. If there are no more such 

15 groups, the significance scan is terminated and we proceed to the accuracy update step 
described later in section 4.2.3.2. If the current group is still labeled Type\6 y we 
decode the boolean value b _ Group < b . If b_ Group < b , then the group is currently 
insignificant and we proceed to the next group. Else, the label Type\6 is removed and 
the group is split to 4 subgroups of Type4 . If the group of 1 6 was not labeled Typel6 , 
20 then it must have been split at a higher bit plane. In both cases we must now continue 
to process the four subgroups of four coefficients. The decoder uses the same 
probability models as the encoder for the purpose of arithmetic decoding. 



Groups of 4 . We loop over the subgroups of four coefficients. If a subgroup is 
25 labeled Type4, the value of the test b _ Subgroup <b is arithmetically decoded. The 
arithmetic decoding step is not performed if the following three conditions hold: 
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1. We are now at the fourth subgroup. 

2. The previous three subgroups are still labeled TypeA . 

5 

3. The four subgroups were created just now at the current scan by splitting 
a group of 1 6. 

Recall that in such a case the group is known to be significant, was not encoded 
1 0 and therefore need not be decoded. 

If the current subgroup is (still) insignificant we proceed to the next subgroup. 
Else, the label TypeA is removed, the subgroup is split to the single coefficients and 
each of them is labeled TypeX . If the subgroup was not labeled TypeA , then it must 
15 have been split at a higher bit plane. In both cases we must now continue to handle 
each individual coefficient. 

Sin pie coefficient . We loop over the four coefficients. If a coefficient 
coef(x,y)is still labeled Typel, the value of the test b{x,y) <b is arithmetically 
20 decoded. The arithmetic decoding step is not performed if the following three 

conditions hold: 

1 . We are now at the fourth coefficient. 

25 2 . The previous three coefficients are still labeled TypeX . 

3. The subgroup of the four coefficients was split just now at the current 
scan. 
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Recall that in such a case the coefficient is known to be significant so there is no 
need to decode the significance of the coefficient. 

If the current coefficient is (still) insignificant we proceed to the next 
5 coefficient. Else, the sign of the coefficient is arithmetically decoded, the label Typel 
is removed and the coefficient is added to the list of significant coefficients. At this 
stage the current approximated value coef (x,y) at the decoder is 




10 



which corresponds to the middle of the bit plane interval [e c 2 b , £ c 2 b * 1 ) . 



4.2.3.2. 



Accuracy update 



20 



15 



At the bit plane b , the significant coefficient list contains all the coefficients 
coef (x,y) for which b(x,y) > b at an approximated value. If this step is performed 
(see section 4.L3.2), then for each coefficient in the list, the value of test 
coef (x,y) < coef[x,y) is arithmetically decoded, where coef (x y y) is the actual value 
of the coefficient (not known to the decoder). We then accordingly update the 
approximated value by adding/subtracting from coef (x,y) the amount 
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5. Client workflow 

With reference to Figure 7, the workflow at the client computer 1 10 is now 
5 described. Any new ROI generated by the user's action, such as a zoom-in or a scroll, 
invokes in step 701 a call from the GUI interface to the client imaging module with the 
new ROI view parameters. The client imaging module then computes in step 702 
which data blocks (see section 3) are required for the rendering of the ROI, and checks 
if these data blocks are available in the client cache 111. If not, their coordinate (L2) is 
10 added to a request list ordered according to some progressive mode. The request list is 
then encoded in step 703 and sent to the server computer 120. 

The server computer 120 responds to the request by sending back to the client 
computer 110 a stream of data blocks, in the order in which they were requested. In 
15 step 704, the client computer 110 inserts them to their appropriate location in the 
distributed database. At various points in time step 705, a rendering of the ROI is 
invoked. Naturally, the rendering operation is progressive and is able to use only the 
currently available data in the client's database. 

20 5.1. Step 701 : Receiving the ROI Parameters 

The imaging module on the client computer 120 receives from the GUI 
interface module view parameters detailed in Table 5. These parameters are used in 
step 702 to generate a request list for the ROI. The same parameters are used to render 
25 the ROI in step 705. 
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Variable 


Meaning 


worldPolygon 


A 2D polygon in the original image coordinate system 


scale 


The view resolution. 0 < scale < 1 implies a low view resolution, 
scale = 1 original resolution and scale > 1 a higher than original view 
resolution 


deviceDepth 


A number in the set {8,16,24] representing the depth of the output 
device (screen, printer) 


vipwDunlitit 

VIC- rr \S WUtil \ 


A /-inoliti/ fartnr in tH^ ranop 11 7 1 \A/Vif*rf* 1 lmnlipc vprv lnw mifllltV Jinn 
f\ CJUalllY laLLUf III U1C ldligC 11, / 1 wijciw 1 iilLJJUCd VCiy IUW kjuaiJijr aiiu 

7 implies lossless quality 


luminanceMap 


If active: a curve defining a mapping of medical images with more than 
8 bits (typically 10,12,16 bits) per grayscale values to an 8 bit screen 


progressiveMo 


One of: Progressive By Accuracy, Progressive By Resolution, 
Progressive by Spatial Order 



The basic parameters of a ROI are worldPolygon and scale which determine 
5 uniquely the ROI view. If the ROI is to be rendered onto a viewing device with limited 
resolution, then a worldPolygon containing a large portion of the image will be coupled 
by a small scale . In the case where the rendering is done by a printer, the ROI could 
be a strip of a proof resolution of the original image that has arrived from the server 
computer 120. This strip is rendered in parallel to the transmission, such that the 
10 printing process will terminate with the end of transmission. The other view 

parameters determine the way in which the view will be rendered. The parameters 
deviceDepth and viewOnality determine the quality of the rendering operation. In 
cases the viewing device is of low resolution or the user sets the quality parameter to a 
lower quality, the transfer size can be reduced significantly (see section 5.2). 

15 

The parameter luminanceMap is typically used in medical imaging for 
grayscale images that are of higher resolution than the viewing device. Typically, 
screens display grayscale images using 8 bits, while medical images sometimes 
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represent each pixel using 16 bits. Thus, it is necessary to map the bigger pixel range 
to the smaller range of [0 ? 255] . 

Lastly, the parameter progressiveMode determines the order in which data 
5 blocks should be transmitted from the server 120. The "Progressive By Accuracy" 
mode is the best mode for viewing in low bandwidth environments. "Progressive By 
Resolution" mode is easier to implement since it does not require the more 
sophisticated accuracy (bit plane) management and therefore is commonly found in 
other systems. The superiority of the "progressive by accuracy" mode can be 
10 mathematically proven by showing the superiority of "non-linear approximation" over 
"linear approximation" for the class of real-life images. See, e.g., R. A. DeVore, 
"Nonlinear approximation", Acta Numerica, pp. 51-150, 1998. 

The "Progressive by Spatial Order" mode is designed, for example, for a "print 
1 5 on demand" feature where the ROI is actually a low resolution "proof print" of a high 
resolution graphic art work. In this mode the image data is ordered and received in a 
top to bottom order, such that printing can be done in parallel to the transmission. 

5.2. Step 702: Creating the request list 

20 

In step 702, using the ROI view parameters, the client imaging module at the 
client computer 1 10 calculates data blocks request list ordered according to the 
progressiveMode . Given the parameters worldPolygon and Scale , it may be 
determined which subband tiles (1.1) in the "frequency domain" participate in the 
25 reconstruction of the ROI in the "time domain". These tiles contain all the coefficients 
that are required for an "Inverse Subband/Wavelet Transform" (IWT) step that 
produces the ROI. First, the parameter dyadicResolution(ROl) is computed, which is 
the lowest possible dyadic resolution higher than the resolution of the ROI. Any 
subband tiles of a higher resolution than dyadicResolution(ROl) do not participate in 
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the rendering operation. Their associated data blocks are therefore not requested, since 
they are visually insignificant for the rendering of the ROI. If scale > 1 , then the 
highest resolution subband tiles are required. If scale < 2 s ~ numbcr0fnesolunon3 then only the 
lowest resolution tile is required. For any other value of scale we perform the 
mapping described below in Table 6. 



Table 6 



10 



15 



20 



25 



scale 


highestSubbandResolution 


scale < 2}~ numi>crG fo eioiutlons 


1 


2l-»umbcrOfRaolurioiu ^ S qqIq < J 


numberOfResolutions -\- log, (scale) J 


scale > 1 


numberOfResolutions 



Once it has been determined which subband tiles participate in the rendering of 
the ROI, it is necessary to find which of their data blocks are visually significant and in 
what order they should be requested. Using well known rate/distortion rules from the 
field of image coding (such as is described in S. Mallat and F. Falzon, "Understanding 
image transform codes", Proc. SPIE Aerospace Conf., 1997), it is not too difficult to 
determine an optimal order in which the data blocks should be ordered by the client 
imaging module (and thus delivered by the server 120). This optimal order is described 
in steps 601-610 of Figure 6 for the "Progressive By Accuracy" mode. The underlying 
mathematical principal behind this approach is "Non-Linear Approximation". 

First, the subband coefficients with largest absolute values are requested since 
they represent the most visually significant data such as strong edges in the image. 
Notice that high resolution coefficients with large absolute value are requested before 
low resolution coefficients with smaller absolute value. Within each given layer of 
precision (bit plane) the order of request is according to resolution; low resolution 
coefficients are requested first and the coefficients of highestSubbandResolution are 
requested last. 
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The main difficulty of this step is this: Assume a subband tile (1.1) is required 
for the rendering of the ROI. This means that 

i resolution < dyadicResolution (ROI) and the tile is required in the IWT procedure 
5 that reconstructs the ROI. It must be understood which of the data blocks (1.2) 

associated with the subband tile represent visually insignificant data and thus should 
not be requested. Sending all of the associated data blocks will not affect the quality of 
the progressive rendering. However, in many cases transmitting the "tail" of data 
blocks associated with high precision is unnecessary since it will be visually 
1 0 insignificant. In such a case, the user will see that the transmission of the ROI from the 
server 120 is still in progress, yet the progressive rendering of the ROI seems to no 
longer to change the displayed image. 



Some examples and notes follow: 



15 



20 



1. 



Assume the ROI is a low resolution rendering of the full image at the 
resolution n . Thus, all the subband coefficients at the subband 
resolutions 1 < j < n participate in the IWT of the ROI. But only the 
very large coefficients of these subbands are visually significant for a 
rendering of this ROI, even at very high qualities. Also these large 
coefficients are only required at a very low precision. Thus, we can 
request from the server 120 only the first few data blocks associated 
with each subband tile at resolutions < n , since they correspond to the 
highest bit planes. 



25 



2. 



Assume the ROI is at the highest resolution of the image. In a normal 
setting this would mean that for any subband tile participating in the 
reconstruction of the ROI, all of the associated data blocks should be 
requested, such that the rendering will be of a very high (or even 
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lossless) quality. But if the parameter deviceDepth (see Table 5) is set 
at low resolution (e.g., the viewing device is of low resolution) or the 
parameter viewQuality is set at low quality (e.g., the user is currently 
not interested in high quality rendering), the last data blocks of each 
subband tile corresponding to high precision need not be ordered. 

The method to determine which of the first data blocks are required is as 
follows. For each subband tile (r _y,t _resolution) participating 
in the reconstruction of the ROI, we initialize the set of requested data 
blocks to the "maximal" set: 

(/ _ x, t _ y, t _ resolution^ _ bitPlane) 
minRequestPlane < t _ bitPlane < maxBitPlane(t _ resolution) 

with minRequestPlane = 0 . Recall that at this point the client 120 does 
not know the amount of data each absent data block contains. In many 
cases requested data blocks will turn out to be empty. 

20 1 . We remove from the data set all the data blocks 

(/ _x,t _y,t _ resolution^ ^bitPlane) such that 

/ _ bitPlane < numberOfResolutions - dyadicResolution (ROI) 

25 That is, one data block/bit plane, for each lower resolution. We update 

minRequestPlane accordingly. This simple rule of thumb that the lower 
the resolution the less precision required originates from the way 
subband coefficients are scaled according to their resolution. 



10 
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10 



15 



2. Whenever deviceDepth = 8 , the output device is of low resolution. In such 
a case the client can request less precision. We decrement one bit plane and 
set 



min 



RequestPlane = min(maxBitPlane{t _ resolution) , minRequestPlane + 1) 



If viewQuality is set at a lower viewing quality, we further remove high 
precision data blocks from the set. Again, the rule is a removal of one 
bit plane per lower quality. 



4. For subband tiles at dyadicResolution{ROl) , we perform an extra data 
reduction step, an interpolation between the precision layers: 
minRequestPlane and maxBitPlane(t Resolution) . The underlying idea 
of the interpolation is this: if -log 3 (scale) = dyadicResolution(ROl) then 
the ROI is exactly at the dyadic resolution dyadicResolution(ROl) and the 
parameter minRequestPlane should not be updated. On the other hand, if 
-log 2 (scale) = dyadicResolution(ROl)-\ then the ROI is at a lower 
20 dyadic resolution than the subband tile and non of the data blocks should be 

ordered. Thus, for any scale between these two values, we take a linear 
interpolation between these two extremes. This technique leads to a unique 
feature of continuous data streaming control. In cases where the user 
performs a sequence of zooms with small increments, a little additional data 
25 is ordered and rendered each time. Thus, although the system is using a 

multiresolution representation of the image discretized to dyadic resolutions, 
the fourth accuracy coordinate makes the representation more "continuous". 



BNSDOCID: «rWO 0116764A1J_> 



WO 01/16764 



PCT/US00/23479 



37 

5.3. Step 703: Encoding the request list 

In step 703, the client imaging module in the client computer 1 10 encodes the 
5 request list into a request stream which is sent to the server computer 120 via the 
communication network 130 (see Figure 1). The request list encoding algorithm is a 
simple rectangle-based procedure. The heuristics of the algorithm is that the requested 
data block usually can be grouped into data block rectangles. From the request list of 
data blocks indexed by (1.2) the encoding algorithm computes structures of the type 

10 

{(' - ■*» ' _ ^ ' _ resolution, t _ bitPlane) , n s , n v ] , n x , n y > 1 

(1.3) 

Each such structure represents the n x x n data blocks 
15 {(( _x + i,t _y + j,t ^resolution,! ^bitPlane)} , 0<i<n x ,0< j <n y 

The encoding algorithm attempts to create the shortest possible list of structures 
of type (1.3) which can be done by collecting the data blocks to the largest possible 
rectangles. It is important to note that the algorithm ensures the order of data blocks in 

20 the request list is not changed, since the server 120 will respond to the request stream 
by transmitting data blocks in the order in which they were requested. A good example 
of when this works well is when a user zooms in into a ROI at a high resolution that 
was never viewed before. In such a case the request list might be composed of 
hundreds of requested data blocks, but they will be collected to one (x,y) rectangle 

25 (1 .3) for each pair (/ _ resolution, t _ bitPlane) . 
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5-4. Step 704: Receiving data blocks 

In step 704, the client computer 110 upon receiving from the server computer 
5 120 an encoded stream containing data blocks, decodes the stream and inserts the data 
blocks into their appropriate location in the distributed database using their ID as a key. 
The simple decoding algorithm performed here is a reversed step of the encoding 
described later in section 6.5. Since the client 1 10 is aware of the order of the data 
blocks in the encoded stream, only the size of each data block need be reported along 
10 with the actual data. In case the server 120 informs of an empty data block, the 
receiving module marks the appropriate slot in the database as existing but empty. 

Recall that the subband tile associated with each data block is denoted by the 
first three coordinates of the four coordinates of a data block (t_x 9 t_y,t ^resolution) . 

15 From the subband tile's coordinates the dimensions are calculated of the area of visual 
significance; that is, the portion of the ROI that is affected by the subband tile. Assume 
that each subband tile is of length tileLength and that the wavelet basis used has a 
maximal filter size maxFilterSize , then defining hFilterSize-\ maxFilterSize/l) and 
factor := numberOJResolutions -t _ resolution + 1 , we have that the dimensions of the 

20 affected region of the ROI (in the original image's coordinate system) are 

[/ _ x x tilelength /octor - hFilterSize fac,or , {t _ x + 1) x tiIeLength /ttC ' or + hFilterSize /aaor ] x 
[t _y x tileLength farlor - hFilterSize facior , (r _ y + 1) x tileLength faaor + hFilterSize factor ] 

These dimensions are merged into the next rendering operation's region. The 
25 rendering region is used to efficiently render only the updated portion of the ROI. 
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5*5. Step 705: Progressive Rendering 

During the transmission of ROI data from the server 120 to the client 110, the 
5 client 110 performs rendering operations of the ROI. To ensure that these rendering 
tasks do not interrupt the transfer, the client may run two program threads: 
communications and rendering. The rendering thread runs in the background and 
draws into a pre-allocated "off-screen" buffer. Only then does the client 110 use device 
and system dependant tools to output the visual information from the off-screen to the 
10 rendering device such as the screen or printer. 

The rendering algorithm performs reconstruction of the ROI at the highest 
possible quality based on the available data at the client 110. That is, data that was 
previously cached or data that just arrived from the server 120. For efficiency, the 
15 progressive rendering is performed only for the portion of the ROI that is affected by 
newly arrived data; specifically, data that arrived after the previous rendering task 
began. This updated region is obtained using the method of step 704, described 
previously in section 5.4. 

20 The parameters of the rendering algorithm are composed of two sets: 



1 . The ROI parameters described in Table 5. 

2. The parameters transmitted from the server 120 explained in Table 3, 
with the exception of the jumpSize parameter, which is a server-only 

25 parameter. 



The rendering algorithm computes pixels at the dyadic resolution 
dyadicResolution(ROl) . Recall that this is the lowest possible dyadic resolution 
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which is higher than the resolution of the ROL The obtained image is then resized to 
the correct resolution. 

Using a tiling of the multiresolution representation of the ROI, the steps of the 
5 algorithm are performed on a tile by tile basis as described in Figure 22. Since the 
tiles' length are UleLength , which is typically chosen as 64, the rendering algorithm is 
" memory efficient. This is especially important in printing mode, when the rendered 
ROI may actually be a proof print (low) resolution of the full image. 

10 5.5.1. The rendering rate 

As ROI data is transmitted to the client 110, the rendering algorithm is 
performed at certain time intervals of a few seconds. At each point in time, only one 
rendering task is performed for any given displayed image. To ensure that progressive 

15 rendering does not become a bottleneck, two rates are measured: the data block transfer 
rate and the ROI rendering speed. If it predicted that the transfer will be finished 
before a rendering task, a small delay is inserted, such that rendering will be performed 
after all the data arrives. Therefore, in a slow network scenario (as the Internet often 
is), for almost all of the progressive rendering tasks, no delay is inserted. With the 

20 arrival of every few kilobytes of data, containing the information of a few data blocks, 
a rendering task visualizes the ROI at the best possible quality. In such a case the user 
is aware that the bottleneck of the ROI rendering is the slow network and has the option 
to accept the current rendering as a good enough approximation of the image and not 
wait for all the data to arrive. 



25 



5.5.2. Memory constraint subband data structure 



This data-structure is required to efficiently store subband coefficients, in 
memory, during the rendering algorithm. This is required since the coefficients are 
30 represented in long integer (lossless coding mode) or floating-point (lossy coding 
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mode) precision which typically require more memory than pixel representation (1 
byte). In lossy mode, the coefficients at the client side 1 10 are represented using 
floating-point representation, even if they were computed at the server side 120 using 
an integer implementation. This will minimize round-off errors. 

5 

At the beginning of the rendering algorithm, coefficient and pixel memory strips 
are initialized. dyadicWidth (ROl) may be denoted as the width of the projection of the 
ROl onto the resolution dyadicResolution (ROl) . For each component and resolution 
1 < j < dyadicResolution(ROl) , three subband coefficient strips are allocated for the 
10 three types of subband coefficients: hl,hl y hh. The coefficient strips are allocated with 
dimensions 



2 y-^w^(,o/ H x dyadicWidth (ROl)lx tileLength + maxFa ^ S ^ 



15 For each component and resolution 1 < j < dyadicResolution a pixel strip is 

allocated with dimensions 

maxFilterSize 



2J -^*cR<soiu«o»iRon x dyadicWidth (ROl) i tileLength+- 



Beginning with the lowest resolution 1, the algorithm proceeds with a recursive 
20 multiresolution march from the top of the ROl to bottom (y direction). The pseudo- 
code for this recursive march is provided in Figure 23. Referring to Figures 21 and 22, 
in step 2101, the multiresolution strips are filled with sub-tiles of coefficients 2150 
decoded from the database or read from the memory cache. From the coefficients we 
obtain multiresolution pixels 2151 using an inverse subband transform step 2102 
25 (shown in further detail in Figure 21). Each time a tile of pixels at resolutions 

j < dyadicResolution (ROl) is reconstructed, it is written into the pixel strip at the 
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resolution j . Each time a tile of pixels at the highest resolution 
dvadicResolution{ROl) is reconstructed, it is fed into the inverse color transform and 
resizing steps 2 1 03, 2 1 04. 

5 5.5.3. Step 2101 : decoding and memory caching 

The subband coefficients data structure described previously in section 5.5.2 is 
filled on a tile basis. Each such subband tile is obtained by decoding the corresponding 
data blocks stored in the database or reading from the memory cache. The memory 

10 cache is used to store coefficients in a simple encoded format. The motivation is this: 
the decoding algorithm described previously in section 4.2 is computationally intensive 
and thus should be avoided whenever possible. To this end the rendering module uses 
a memory cache 1 1 1 where subband coefficient are stored in very simple encoded 
format which decodes very fast. For each required subband tile, the following 

15 extraction procedure is performed, described in Figure 17, beginning at step 1701. In 
step 1702, if no data blocks are available in the database for the subband tile, its 
coefficients are set to zero (step 1703). In step 1704, if the tile's memory cache storage 
is updated, namely it stores coefficients in the same precision as in the database, then 
the coefficients can be efficiently read from there (step 1705). In step 1706, the last 

20 possibility is that the database holds the subband tile in higher precision. Then, the tile 
is decoded down to the lowest available bit plane using the algorithm previously 
described in section 4.2 and the cached representation is replaced with a representation 
of this higher precision information. 

25 5.5.4. Step 2102: inverse subband transform 

This is an inverse step to step 904 performed in the server 120 (see section 
6.1.6, described later). As shown in Figure 20, four extended subband coefficient sub- 
tiles of length tileLength/2 + maxFUterSize at the resolution / (shown as reference 
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numeral 2001) are read from the coefficient strips data structure and transformed to a 
tile of pixels (reference numeral 2003) at the next higher resolution using 

subbandTransformType{j) (reference numeral 2002). If 

j + \ < dyadicResolution(ROl) , the tile of pixels obtained by this step is inserted into 

5 the pixel memory strip at the resolution y + 1 . If j + 1 = dyadicResolution (ROI) the 
tile of pixels is processed by the next step of color transform. 

5.5.5. Step 2103: inverse color transform 

10 This is an inverse step to step 903 performed at the server 120 (see §6.1.5, 

described later). It is performed only for tiles of pixels at the resolution 
highestSubbandResolution . At this stage, all of the pixels of each such tile are in the 
outpuiColorSpace and so need to be transformed into a displaying or printing color 
space. For example, if the original image at the server 120 is a color image in the color 
15 space RGB, the pixels obtained by the previous step of inverse subband transform are 
in the compression color space YIQ. For viewing they must be transformed back to 
RGB representation. If the ROI is at an exact dyadic resolution, then the tile of pixels 
in the rendering color space is written into the off-screen buffer. Else it is resized in the 
next step. 

20 

5.5.6. Step 2104; image resize 

In case the resolution of the ROI is not an exact dyadic resolution, the image 
obtained by the previous step must be re-sized to this resolution. This can be 
25 accomplished using operating system imaging functionality. In most cases the 

operating system's implementation is sub-sampling which produces in many cases an 
aliasing effect which is visually not pleasing. To provide higher visual quality, the 
imaging system of the present invention may use the method of linear interpolation, for 
example described in J. Proakis and D. Manolakis, ''Digital signal processing", Prentice 
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Hall, 1 996. The output of the interpolation is written to the off-screen buffer. From 
there it is displayed on the screen using system device dependant methods. 

6. Server workflow 

5 

With reference to Figure 8, the operation of the server computer 120 (Figure 1) 
will now be described. Initially, an uncompressed digital image is stored in, for 
example, storage 122 of the server computer 120. This uncompressed digital image 
may be a two-dimensional image, stored with a selected resolution and depth. For 
10 example, in the medical field, the uncompressed digital image may be contained in a 
DICOM file. In the graphic arts field, the uncompressed image may be, for example, in 
the Tiff standard format or a result of a RIP (Raster Image Processing) algorithm 
converting a postscript file to a raster image. 

15 Once the client computer 1 10 requests to view or print a certain image, the 

server 120 performs the preprocessing step 801 . This step is a computation performed 
on data read from the original digital image. The results of the computation are stored 
in the server cache device 121. After this fast computation, a ready to serve message is 
sent from the server 120 to the client 1 10 containing basic information on the image. 

20 

In step 802, the server 120 receives an encoded stream of requests for data 
blocks associated with a ROI that needs to be rendered at the client 110. The server 120 
then decodes the request stream and extracts the request list. 

25 In step 803, the server 120 reads from cache 121 or encodes data block 

associated with low resolution portions of the ROI, using the cached result of the 
preprocessing stage 80 1 . 

If the ROI is a high resolution portion of the image, the server 120, in step 804, 
30 reads from cache 1 2 1 or performs a "local" and efficient version of the preprocessing 
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step 801. Specifically, a local portion of the uncompressed image, associated with the 
ROI, is read from the storage 122, processed and encoded. Data encoded in steps 803- 
804 is progressively sent to the client 1 10 in the order it was requested. 

5 6.1. Step 801: preprocessing 

The preprocessing step is now described with respect to Figure 9. The 
preprocessing algorithm's goal is to provide the fastest response to the user's request to 
interact with the image. Once this fast computational step is performed, the server 120 

10 is able to provide efficient Pixel-On-Demand™ transmission of any client ROI requests 
that will follow. In most cases the first ROI is a view of the full image at the highest 
resolution that fits the viewing device. The preprocessing algorithm begins with a 
request for an uncompressed image that has not been processed before or has been 
processed but the result of this previous computation has been deleted from the cache 

15 121. As explained, this unique algorithm replaces the possibly simpler, yet less 
effective, procedure of encoding the full image into some progressive format. This 
latter technique will provide a much slower response to the user's initial request then 
the present technique described below. 

20 At the end of the algorithm a ready to serve ROI of the image message is sent to 

the client 110 containing basic information on the image. While some of this 
information, image dimensions, original color space, resolution etc., is available to the 
user of the client computer 1 10, most of this information is internal and required by the 
client 1 10 to formulate ROI request lists (see section 5.2, described previously) and 

25 progressively render (see section 5.5). 

Next the preprocessing algorithm is described in detail. 
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6.1-1. Preprocessing Parameters 



Table 7 



Variable 


Meaning 


losslessMode 


Tf tni*» nrpnmrpcdna will nreDare data that can be used for 

[ I p leu lUCCo olil Will pi \* v****.** w * 

lossless transmission. 


subbandTransform Typ 


The framework allows the ability to select a different subband 
transform for each resolution of each image. The technical term 
is: non-stationary transforms. 


numberOJResolutions 


The number of resolutions in the Multiresolution structure 
calculated for the image. 

Typically, numberOJResolutions = log 2 {^ImageSize} . 


jumpSize 


A number in the range [Q, numberOJResolutions -l] . The 
preprocessing stage computes only the top lower part of the 
image's multiresolution pyramid of the 
size numberOfResolutions - jumpSize . 


tileLength 


Twiimllv = 64 Tradeoff between time and coding performance. 


nTilesX(j) 


Number of subband tiles in the x direction at the resolution j 


nTilesX(j) 


Number of subband tiles in the y direction at the resolution j 


inputColorSpace 


Color space of uncompressed original image. 


outputColorSpace 


Transmitted color space used as part of the encoding technique. 


numberOfComponents 


Number of components in OutputColorSpace . 


threshold {cj) 


A matrix of values used in lossy compression. The subband 
coefficients of the component cat the resolution j with absolute 
value below threshold [cj) are considered as (visually) 
insienificant and set to zero. 



Given an input image, the parameters described in Table 7 are computed 
chosen. These parameters are also written into a header sent to the client 1 10 an 



RNSnoniD: <WO 0H 6764A1 1 > 



WO 01/16764 



PCT/US00/23479 



47 

used during the progressive rendering step 705 (see section 5.5, described previously). 
The important parameters to select are: 

1 . losslessMode : In this mode, progressive transmission of images takes 
place until lossless quality is obtained. Choosing this mode requires the 
preprocessing algorithm to use certain reversible wavelet transforms, 
and can slow down the algorithm. 

2. sitbbandTransformType(j) : The (dynamic) selection of wavelet basis 
(as described, for example, in I. Daubechies, "Ten lectures on wavelets", 
Siam, 1992) is crucial to the performance of the imaging system. The 
selection can be non-stationary, meaning a different transform for each 
resolution j . The selection is derived from the following 
considerations: 

a. Coding performance (in a rate/distortion sense): This is 
obviously required from any decent subband/wavelet transform. 

b. Approximation of ideal low pass: It is favorable to choose a 
transform such that low resolutions of the image will be of high 
visual quality (some filters produce poor quality low resolutions 
even before any compression takes place). 

c. Fast transform implementation: Can the associated fast transform 
be implemented using lifting steps (as described, for example, by 
I. Daubechies and W. Sweldens, "Factoring wavelet transforms 
into lifting steps" J. Fourier Anal. Appl., Vol. 4, No. 3, pp. 247- 
269, 1998), using only integer shifts and additions, etc. Some 
good examples are the Haar and CDF transforms (1,3), (2,2) 
described in I. Daubechies, "Ten lectures on wavelets", Siam, 
1992. 
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d. Fast low pass implementation: A very important parameter,' 
since together with the parameter jump Size, it determines 
almost all of the complexity of the algorithm. For example, the 
CDF (1,3) is in this respect the "optimal" transform with three 

5 vanishing moments. Since the dual scaling function is the simple 

B-spline of order 1, its low pass is simple averaging. Thus, the 
sequence of CDF transforms, using the B-spline of order 1 as the 
dual scaling function, but with wavelets with increasing number 
of vanishing moments are in some sense optimal in the present 
10 system. They provide a framework for both real time response 

and good coding efficiency. 

e. Lossless mode: If loss lessMode is true we must choose the 
filters from a subclass of reversible transforms (see, for example, 
"Wavelet transforms that map integers to integers", A. 

15 Calderbank, I. Daubechies, W. Sweldens, B. L. Yeo, J. Fourier 

Anal. Appl., 1998). 

f. Low system I/O: If the network 130 in Figure 1 connecting 
between the Image residing on the storage 122 and the imaging 
server 120 is slow, the bottleneck of the preprocessing stage (and 

20 the whole imaging system for that fact) might be simply the 

reading of the original image. In such a case a transform may be 
chosen with a lazy sub-sampling low pass filter that corresponds 
to efficient selective reading of the input image. Many 
interpolating subband transforms with increasing number of 

25 vanishing moments can be selected to suit this requirement. 

However, this choice should be avoided whenever possible, since 
it conflicts with (a) and (b). 

g. Image type: If the type of the image is known in advance, an 
appropriate transform can be chosen to increase coding 

30 efficiency. For example: Haar wavelet for graphic images, 



BNSOOCIO: <WO 0116764A1J_> 



WO 01/16764 



PCT/USOO/23479 



49 



smoother wavelet for real-life images, etc. In the graphic arts 
field, there are numerous cases of documents composed of low 
resolution real-life images and high resolution graphic content. 
In such a case, a non-stationary sequence of transforms may be 
5 chosen: Haar for the high resolutions and a smoother basis 

starting at the highest resolution of a real-life image part. In case 
of low system I/O (f), a non-stationary choice of interpolating 
transforms of different orders is required. 

10 3. jumpSize : This parameter controls the tradeoff between fast response 

to the user's initial request to interact with the image and response times 
to subsequent ROI requests. When jumpSize is large, the initial 
response is faster, but each computation of a region of interest with 
higher resolution than the jump might require processing of a large 

15 portion of the original image. 

4. outputColorSpace : The following are color spaces which perform well 
in coding: 

a. Grayscale: For grayscale images 

b. YIQ: for viewing color images 

20 c. YIQK: for printing CMYK images 

d. Lab: For both viewing and printing, and assuming a color 

management system exists at the server 120 and client 110 side. 

For example, transforming an RGB to the YIQ color space 
25 provides a coding gain of 50%, according to A. Drukarev, 

"Compression related properties of color spaces", Proc. SPIE 
Vol. 3024, 1997 and N. Moroney and M. Fairchild, "Color space 
selection for JPEG image compression", J. Elec. Imaging, Vol. 
4(4), pp. 373-381, 1995. 
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10 



5. 



threshold {c,j) : These parameters control the visual quality in case of 
lossy compression. The smaller the thresholds, the higher the visual 
quality. Naturally, the tradeoff is quality for bit-rate. A typical choice 
for visually lossless quality in the case of a YIQ output color space is 
described in Table 8 below 

Table 8 



Resolution / Component 


Y 


I 


Q 


j = numberOfResolutions 


6 


12 


12 


j < numberOfResolutions 


4 


8 


8 



The choice is also derived from the mathematical properties of the pre-chosen 
subband transform(s). 

1 5 By padding (adding sufficient rows and columns) each resolution j , an exact 

tiling is obtained of the multiresolution structure with {nTilesX(j),nTilesY(j)} tiles 
for each resolution j , each of length tileLength . 



20 



6.1.2. Memory constraint multiresolution scan data structure 



Most wavelet coding algorithms have not addressed the problem of memory 
complexity. Usually the authors have assumed there is sufficient memory such that the 
image can be transformed in memory from the time domain to a wavelet frequency 
domain representation. It seems the upcoming JPEG2000 will address this issue, as did 
25 its predecessor JPEG. The preprocessing algorithm also requires performing subband 
transforms on large images, although not always on the full image, and thus requires 
careful memory management. This means that the memory usage of the algorithm is 
not of the order of magnitude of the original image, as described in J. M. Shapiro,"An 
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embedded hierarchical image coder using zero-trees of wavelet coefficients", IEEE 
Trans. Sig. Proc. ; Vol. 41, No. 12, pp. 3445-3462, 1993. 

Given an uncompressed image we allocate the following number of memory 

5 strips 

numberOfComponents x [numberO/Resolutions - jumpSize) 



^ 2 ^« um ^«o;w^.y, //n ^ e ^ r/i>3 x tUeLength 1 2 + maxFilterSize] 

for 1< j < numberO/Resolutions - jumpSize -I and 

[image Width, tileLength + 2 x maxFilterSize] 

for j - numberO/Resolutions - jumpSize 

That is, the memory usage is proportional to 2~ JttmpS,zc x imageWidth . Each such 
strip stores low-pass coefficients in the color space outputColorSpace at various 
resolutions. 

A snapshot visualization of the multiresolution data structure during the 
preprocessing algorithm is provided in Figure 18. Referring to Figure 9, during the 
preprocessing stage, the resolutions are scanned simultaneously from start to end in the 
y direction. For each color component and resolution, the corresponding strip (1801 in 
Figure 18) stores low-pass coefficients or pixels at that resolution. The core of the 
preprocessing algorithm are steps 904-907, where tiles of pixels of length 
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tileLength + 2 x maxFilterSize are read from the memory strips and handled one at a 
time. In step 904 the tile is transformed into a tile of length tileLength containing two 
types of coefficient data: subband coefficients and pixels at a lower resolution. The 
subband coefficients are processed in steps 905-906 and are stored in the cache. The 
5 lower resolution pixels are inserted in step 907 to a lower resolution memory strip. 
Whenever such a new sub-tile of lower resolution pixels (see section 6. 1 .9) is inserted 
.dnto a strip, the algorithm's memory management module performs the following 
check: If the part of the sub-tile exceeds the current virtual boundaries of the strip, the 
corresponding first lines of the strip are considered unnecessary anymore. Their 
1 0 memory is (virtually) re-allocated for the purpose of storing the sub-tile's new data. 
The pseudo-code of the memory constraint scan algorithm is detailed in Figure 1 9. 

6.1.3. Step 901: Non-Linear Color Transform 

15 The first step, step 901 is optional. It is only performed in cases where the color 

conversion from the original uncompressed color space to the compressed color space 
is a non-linear transformation. A good example is CMYK -» YIQ color conversion. 
The output of this step, which are pixels in the color space oatputColorSpace , are not 
stored, but fed directly to the next step 902. 



20 



6.1.4. Step 902: (Subband) Low Pass 



numt 



In step 902, the low pass filters of the transforms subbandTransformType(j) , 
iberOJResolutions-jumpSize <j< numberOJResolutions , are used to obtain a low 
25 resolution strip at the resolution numberOJKesohitions ■ jumpSize (as can be seen in 
Figure 10). Typically, it is required to low pass about lines of the original 

image 1010 to produce one line of the low resolution image. The low pass calculation 
is initiated by a read of tiles of pixels from the memory strips performed in step-904. , 
Whenever there is an attempt to read missing low resolution lines, they are computed 
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by low passing the original image and inserted into the memory strip. As explained in 
section 6.1.2, the insertion over-writes lines are no longer required, such that the 
algorithm is memory constrained. In the case where a non-linear color transform took 
place in the previous step 901, the results of that transform are low-passed. By jumping 
5 up the multiresolution ladder (jump 1001 in Figure 10) to resolution 101 1, we avoid 
computation and storage of high resolution subband coefficients. 

For example by choosing jumpSize = 2 , the main preprocessing stages: (linear) 
color transform, subband transform, quantization and storing will be done only for a 

10 low resolution image that is 16 times smaller than the original image. In case a user 
views a higher resolution ROI, the server 120 will have to read (again) the associated 
local area of the original image and perform in step 804 more computations (see section 
6.4). Choosing JumpSize = 2 means that the length of this local area is bounded by 
2xmax(widthOJDisplay J heightOJDisplay) , which is typically = l,400pixels and 

15 small enough for real-time computation. Choosing a bigger jump leads to faster 

response time assuming the original image can be read from the disk fast enough, but 
might cause more computations later on. 

In a lossy mode, the low pass filtering can be implemented efficiently in integer 
20 without paying much attention to round-off errors. In lossless mode 

(losslessMode = true ), care must be taken to ensure that the result of the low pass step, 
which are low resolution pixels, can be used in a lossless framework. Specifically, it 
must be ensured that in a scenario where a user views a low resolution ROI and then 
zooms into a high resolution ROI, the low resolution data already present at the client 
25 side can be used for the rendering such that only the difference between the low 

resolution and high resolution need to be progressively transmitted until lossless quality 
is reached. Thus, the low pass is implemented using a special lossless integer-to- 
integer transform technique, such as that described in "Wavelet transforms that map 
integers to integers", A. Calderbank, I. Daubechies, W. Sweldens, B. L. Yeo, J. Fourier 
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Anal. Appl., 1998. Unfortunately, this means that this step is slower in lossless mode, 
due to this special implementation. Nevertheless, it is one of the only ways to provide a 
fast response to the user's first request to view the image in a true lossless efficient 
imaging system. 

5 

Other less sophisticated solutions include: 

1 . Transmitting a lossy representation of the image first and a lossless high 
resolution representation when required, without using the lossy data 

10 already sent. 

2. Transmitting a lossy representation of the image first and a difference 
between the lossy representation and the original image whenever 
lossless quality is required. To compute this difference, the server 120 

1 5 must simulate the client side 1 1 0 and reconstruct the lossy 

representation. 



6.1.5. Step 903: Linear color transform 

20 Step 903 is performed whenever the color transform from inputColorSpace to 

outputColorSpace is a linear operation. In such a case, the color transform is 
postponed to this stage of the algorithm, although traditionally this is the first step in 
any transform coding algorithm. The reason for this is that, since the computation of 
the low pass of step 902 and a linear color transform are commutative, it is much more 

25 efficient to first perform low pass filtering in the InputColorSpace domain and only 
then perfonn the linear color transform on less pixels. 



BNSDOCID: <WO Ot 16?64A1_I_> 



WO 01/16764 



PCT/USOO/23479 



55 

6.1.6. Step 904: The subband transform 

In step 904, one step of an efficient local subband transform (FWT) is 
5 performed on a tile of pixels at the resolution 1 < j < numberOfResolutions - jumpSize . 
The type of transform is determined by the parameter subbandTransformType(j) . As 
described in Figure 20, the transform is performed on an extended tile of pixels of 
length tileLength + 2 x maxFilterSize (unless at the boundaries) and is read directly 
from a multiresolution strip at the resolution j + 1 . The output of the step is a subband 

10 tile (1.2) of length tileLength composed of subband/wavelet coefficients and low 
resolution coefficients/pixels. The transform step can be efficiently implemented in 
integers. This saves converting the original pixels to floating-point representation and 
facilitates the next quantization step. Although the imaging system, as described, can 
work with any subband/wavelet transform and any implementation, an efficient 

1 5 implementation of a carefully selected transform should have the following properties: 

Integer implementation using only additions and shifts. 

Efficient "Lifting Steps" factorizations as explained in I. Daubechies and 
W. Sweldens, "Factoring wavelet transforms into lifting steps", J. 
Fourier Anal. AppL, Vol. 4, No. 3, pp. 247-269, 1998. 

Implementation of the two tensor-product steps done at once (see I. 
Daubechies, "Ten lectures on wavelets", Siam, 1992). This replaces the 
traditional implementation of one-dimensional transforms first on rows 
and then on columns and leads to a better memory management and is 
faster. 
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4. The known univariate scaling factors of size l/ yfl are not implemented. 
Instead, since a tensor 2D transform is performed, they can be collected 
to a single scaling factor of 2 and thus implemented using a shift. With 
careful implementation, even this scaling operation can be avoided by 
inserting it to the last lifting step. 

: 5. In case an acceleration processor that supports parallel low level 

computations is present in the server computer 120, it can be used to 
accelerate this step by paralleling a few lifting Steps. 

The subband transform of step 904 outputs two types of coefficients: scaling 
function (low pass) and wavelet (high pass). The wavelet coefficients are treated in 
steps 905-906 while the scaling function coefficients are treated in step 907. 

1 5 Note that tiles of pixels which are located on the boundaries sometimes need to 

be padded by extra rows and/or columns of pixels, such that they will formulate a full 
tile of length tileLength . 



10 



20 



6.1.7. Step 905: Quantization of subband coefficients 



In step 905, unless losslessMode is true, the subband coefficients calculated in 
step 904 are quantized. This procedure is performed at this time for the following 
reason: It is required that the coefficients computed in the previous step will be stored 
in the cache 121. To avoid writing huge amounts of data to the cache, some 
25 compression is required. Thus, the quantization step serves as a preparation step for the 
next variable length encoding step. It is important to point out that the quantization 
step has no effect on compression results. Namely, the quantization step is 
synchronized with the encoding algorithm described previously in section 4.1, such that 
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the results of the encoding algorithm of quantized and non-quantized coefficients are 
identical. 

A tile of an image component c at the resolution j is quantized using the given 
5 threshold threshold (c,j) : for each coefficients x , the quantized value is 

{xj threshold {c y j)\ . It is advantageous to choose the parameters threshold (cj) to be 
dyadic such that the quantization can be implemented using integer shifts. The 
quantization procedure performed on a subband tile is as follows: 

10 1 . Initialize maxBitPlane ( rife) = 0 . 

2. Loop over each group of four coefficients 

[coef (2 x i + .x, 2 x j + y)} q ( . For each such group initialize a variable 

length parameter length (/, j ) = 0 . 

15 

3. Quantize each coefficient in the group coef (2 x / + x, 2 x j + y) using the 
appropriate threshold. 

4. For each coefficient, update length (i 9 j) by the bit plane b of the 
20 coefficient, where the bit plane is defined by 



\coe\ 



rf (2 x / + x, 2 x j + y)\ € [2* threshold (c, j) , 2* +l threshold (c, j)) 



5. After processing the group of four coefficients, use the final value of 
25 length to update maxBitPlane(tile) by 
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maxBitPlane(tile) = max(maxBitPlane (tile) , length (i, jj) 

6 At the end of the quantization step, store the value maxBitPlane(tile) in 
the cache 121. 

5 

Note that for subband tiles located at the boundaries we can set to zero subband 
coefficients that are not associated with the actual image, but only with a padded 
portion. To do this we take into account the amount of padding and the parameter 
maxFiherSize . The motivation for the "removal" of these coefficients is coding 
10 efficiency. 

6.1.8. Step 906: Variable Length encoding and storage 

In step 906 the coefficients that were quantized in the previous step are variable 
15 length encoded and stored in the cache 121. If maxBitPlane(tile) = Owe do not write 
any data. Else we loop on the coefficient groups { coef (2 x / + x, 2 x j + y)} iym0 i • For 
each such group we first write the group's variable length length(ij) using 
log, (maxBUPlcme(tile)) bits. Then for each coefficient in the group we write 
length(ij) + \ bits representing the coefficient's value. The least significant bit 
20 represents the coefficient's sign: if it is 1 then the variable length encoded coefficient is 
assumed to be negative. 

6.1.9. Step 907: Copying low pass coefficients into the 
multiresolution strip structure 



25 



In step 907, the low pass coefficients are inserted into the pyramidal strip data 
structure at the appropriate location. If the length of the subband tile is tileLength , 
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15 



20 



then the length of the low pass sub-tile is tileLengthjl . If the coordinates of the tile 



As explained in section 6.1.2, these coefficients will be merged into a bigger tile of low 
resolution pixels and processed later. 

6.2. Step 802: Decoding the request stream 

This is the inverse step of section 5.3. Once the request stream arrives at the 
server 120, it is decoded back to a data block request list. Each data structure of type 
(1.3) representing a group of requested data blocks is converted to the sub-list of these 
data blocks. 

6.3. Step 803: Encoding low resolution part of ROI 

Step 803 is described in Figure 11. It is only performed whenever the data 
blocks associated with a low resolution subband tile are not available in the server 
cache 121. 

Step 1101 is the inverse step of step 906 described in section 6.1.8. In the 
preprocessing algorithm, subband tiles of lower resolution - that is resolutions lower 
than numberOfResolutions - jumpSize - are stored in the cache using a variable length 
type algorithm. For such a tile, the variable length representation is first decoded. The 
algorithm uses the stored value maxBitPlane(tile) . If maxBitPlane(tile) = 0, then all 



are 



[t _x y t _yJ ^resolution), then the coefficients are copied into strip 



t resolution - 1 at the "virtual" location: 
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the coefficients are set to zero. Else the following simple decoding algorithm is 
performed: 

1 . For each group of four coefficients [coef(2 x / + x, 2 x j + y)} s ymQA , 

5 log, [maxBitPlane (tile)) bits are read representing the variable length of 

the group. 

2. Assume the variable length is length(ij) . For each of the four 
coefficients, length (i, j) + 1 bits are read. The least significant bit 

10 represents the sign. The reconstructed coefficient takes the value: 

f -1 readBits&l = \ 
threshold,(readBits»l)^ ] re ^ &1 = 0 

In step 1 102 the encoding algorithm is used, described in section 4.1 to encode 
1 5 the requested data blocks ( 1 .2) associated with the extracted quantized subband tile. 
Observe that in lossy mode the encoding algorithm is using quantized coefficients read 
from the cache instead of the original coefficients. Nevertheless, this has no effect on 
the encoding algorithm since the quantization procedure 905 removed from each 
coefficient precision data that is not encoded. 

20 

6.4. Step 804: Processing high resolution part of ROI 

Step 804 is described in Figure 12. In case we have used jumpSize > 0 in step 
801 and the resolution of the ROI > numberOJResolutions - jumpSize , we are 
25 sometimes required to perform a local variation of the preprocessing step, described in 
section 6.1 . Whenever the server 120 receives a request list of data, the following is 
checked. If a data block has been previously computed (present in the cache 121) or is 
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associated with a low resolution subband tile data block, then it is either simply read 
from the cache or handled in step 803. Else, the coordinates of the data block are used 
in the same manner as step 704 (see section 5.4) to find the minimal portion of the ROI 
that needs to be processed. Then, a local version of the preprocessing algorithm is 
5 performed for this local portion. The difference here is that step 1026 replaces Variable 
Length coding steps 905-906 of the preprocessing algorithm by the encoding algorithm 
provided in section 4. 1 . 

6.5. Step 805: Progressive transmission of ROI 

10 

In the final step, the encoded data tiles are sent from the server 120 to the client 
1 10, in the order they were requested. In many cases, data blocks will be empty. For 
example, for a region of the original image with a constant pixel value, all of the 
corresponding data blocks will be empty, except for the first one which will contain 

15 only one byte with the value zero representing maxBitPlane{tile) = 0 . For a low 

activity region of the image, only the last data blocks representing higher accuracy will 
contain any data. Therefore, to avoid the extra side information, rectangles of empty 
data blocks are collected and reported to the client 1 10 using a structure of type (1.3) 
under the restriction that they are reported in the order in which they were requested. 

20 For blocks containing actual data, only the data block's size in bytes need be reported, 
since the client 1 10 already knows which data blocks to expect and in which order. 

The present invention has been described in only a few embodiments, and with 
respect to only a few applications (e.g., commercial printing and medical imaging). 
25 Those of ordinary skill in the art will recognize that the teachings of the present 
invention may be used in a variety of other applications where images are to be 
transmitted over a communication media. 
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Claims: 

A system for transmitting a digital image over a communication network, 
comprising: 

(a) an image storage device for storing an original digital image at an 
original resolution; 

(b) a client computer coupled to the communication network, wherein the 
client computer generates at least one request for interaction with the 
original digital image and at least one subsequent request, each 
identifying a region of interest at a selected resolution within the original 
digital image; 

(c) a server computer coupled to the communication network and the 
storage device, the server computer performing the steps of: 

(i) pre-processing the original digital image in response to the 
receipt of the first request, the pre-processed digital image 
having a resolution lower than or equal to the original resolution; 

(ii) for each subsequent request received from the client computer: 

(A) compressing the region of interest associated with each 
request, based upon: 

(1) the pre-processed digital image, and 

(2) further processing of portions of the original 
digital image at the selected resolution of the 
region of interest identified in the subsequent 
request; and 

(B) progressively transmitting the compressed region of 
interest to the client computer via the communication 
network. 

The system of claim 1, wherein the client computer progressively performs the 
steps of: 
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(i) decompressing the compressed region of interest transmitted 
from the server computer; and 

(ii) rendering the decompressed region of interest. 

3. The system of claim 2, wherein the client computer renders the decompressed 
region of interest onto a video display. 

4. The system of claim 2, wherein the client computer renders the decompressed 
region of interest onto a printer. 

5. The system of claim 2, wherein the client computer performs the initial step of 
storing the compressed region of interest in a cache associated with the client 
computer. 

6. The system of claim 5, wherein the client computer only requests that portion of 
a region of interest that was not previously stored in the cache. 

7. The system of claim 1 , wherein the region of interest is transmitted from the 
server computer to the client computer using encoded data blocks having X, Y, 
resolution and accuracy dimensions. 

8. The system of claim 1, wherein the client computer may request the digital 
image from the server computer in a progressive mode selected from the list of: 
resolution, accuracy and spatial. 

9. The system of claim 1, wherein the pre-processing step performed by the server 
comprises the sub-steps of; 

(A) generating a lower resolution digital image from the 

original digital image, reading the original digital image 
from the image storage device; 
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(B) color transforming the lower resolution digital image to a 

compression color space; 
.(C) transforming the compression color space digital image 

into wavelet coefficients; 

(D) performing quantization on the wavelet coefficients; and 

(E) storing the quantified wavelet coefficients in a storage 
device associated with the server computer. 

10. The system of claim 9, wherein the transforming sub-step converts the digital 
image from its original color space to a compression color space, and the client 
computer converts the digital image from the compression color space to a 
rendering color space. 

11. The system of claim 1 , wherein the pre-processing step utilizes constrained 
memory with an order of magnitude of the square root of the original digital 
image size. 

12. The system of claim 1, wherein the compressing step utilizes the non-zero-trees 
type of progressive encoding technique. 

13. The system of claim 1 , wherein in the compressing step, the server computer 
fully compresses the region of interest. 

14. The system of claim 1 , wherein in the compressing step, the server partially 
compresses the region of interest. 

15. The system of claim 1. wherein the progressive transmitting step comprises the 
sub-steps of: 
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( 1 ) transmitting the compressed region of interest 
within the selected digital image at a relatively 
low resolution; and 

(2) transmitting the compressed region of interest 
within the selected digital image at successively 
higher resolutions. 

16. The system of claim 1, wherein the progressive transmitting step comprises the 
sub-steps of: 

( 1 ) transmitting the compressed region of interest within the 
selected digital image at a relatively low level of 
accuracy; and 

(2) transmitting the compressed region of interest within the 
selected digital image at successively higher levels of 
accuracy. 

17. The system of claim 1, wherein the progressive transmitting step comprises the 
sub-steps of: 

(1) transmitting a first band of the compressed region of 
interest within the selected digital image in spatial order 
starting from the top; and 

(2) repeating the transmission of the rest of the bands of the 
compressed region of interest within the selected digital 
image in spatial order. 

18. A system for transmitting a digital image over a communication network, 
comprising: 

(a) an image storage device for storing an original digital image; 



NSDOCID: <WO 01 16764A1 J_> 



WO 01/16764 



PCT/US00/23479 



66 



a client computer coupled to the communication network, wherein the 
client computer generates at least one request, each request identifying a 
region of interest within the original digital image; 
a server computer coupled to the communication network and the 
storage device, the server computer performing the steps of: 

(i) receiving a first request from the client computer; 

(ii) pre-processing the original digital image in response to the 
receipt of the first request; 

(iii) for each subsequent request received from the client computer: 

(A) compressing the region of interest associated with each 
subsequent request, based upon the pre-processed digital 
image; and 

(B) progressively transmitting the compressed region of 
interest to the client computer via the communication 
network. 



of claim 1 8, wherein the client computer progressively performs the 

decompressing the compressed region of interest transmitted 
from the server computer; and 
rendering the decompressed region of interest. 

20. The system of claim 19, wherein the client computer renders the decompressed 
region of interest onto a video display. 

21. The system of claim 1 9, wherein the client computer renders the decompressed 
region of interest onto a printer. 



19. The system 
steps of: 
(i) 

(ii) 
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22. The system of claim 19, wherein the client computer performs the initial step of 
storing the compressed region of interest in a cache associated with the client 
computer. 

23. The system of claim 22, wherein the client computer only requests that portion 
of a region of interest that was not previously stored in the cache. 

24. The system of claim 1 8, wherein the region of interest is transmitted from the 
server computer to the client computer using encoded data blocks having X, Y, 
resolution and accuracy dimensions. 

25. The system of claim 18, wherein the client computer may request the digital 
image from the server computer in a progressive mode selected from the list of: 
resolution, accuracy and spatial. 

26. The system of claim 18, wherein the pre-processing step performed by the 
server comprises the sub-steps of: 

(A) reading the original digital image from the image storage 
device; 

(B) color transforming the lower resolution digital image to a 
compression color space; 

(C) transforming the compression color space digital image 
into wavelet coefficients; 

(D) performing quantization on the wavelet coefficients; and 

(E) storing the quantified wavelet coefficients in a storage 
device associated with the server computer. 

27. The system of claim 26, wherein the transforming sub-step converts the digital 
image from its original color space to a compression color space, and the client 
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computer converts the digital image from the compression color space to a 
rendering color space. 

28. The system of claim 1 8, wherein the pre-processing step utilizes constrained 
memory with an order of magnitude of the square root of the original digital 
image size. 

29. The system of claim 1 8, wherein the compressing step utilizes the non-zero- 
trees type of progressive encoding technique. 

30. The system of claim 1 8, wherein in the compressing step, the server computer 
fully compresses the region of interest. 

31. The system of claim 1 8, wherein in the compressing step, the server partially 
compresses the region of interest. 

32. The system of claim 1 8, wherein the progressive transmitting step comprises the 
sub-steps of: 

(1 ) transmitting the compressed region of interest within the 
selected digital image at a relatively low resolution; and 

(2) transmitting the compressed region of interest within the 
selected digital image at successively higher resolutions. 

33. The system of claim 1 8, wherein the progressive transmitting step comprises the 
sub-steps of: 

(1) transmitting the compressed region of interest within the 
selected digital image at a relatively low level of 
accuracy; and 
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(2) transmitting the compressed region of interest within the 
selected digital image at successively higher levels of 
accuracy. 

34. The system of claim 1 8, wherein the progressive transmitting step comprises the 
sub-steps of: 

(1) transmitting a first band of the compressed region of 
interest within the selected digital image in spatial order 
starting from the top; and 

(2) repeating the transmission of the rest of the bands of the 
compressed region of interest within the selected digital 
image in spatial order. 

35. The system of claim 18, wherein the pre-processing step performed by the 
server comprises the sub-steps of: 

(A) reading the original digital image from the image storage 
device; 

(B) color transforming the original digital image to a 
compression color space; 

(C) transforming the compression color space digital image 
into wavelet coefficients; 

(D) performing quantization on the wavelet coefficients; and 

(E) storing the quantified wavelet coefficients in a storage 
device associated with the server computer. 

36. A system for transmitting a digital image over a communication network, 
comprising: 

(a) an image storage device for storing an original digital image; 
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(b) a client computer coupled to the communication network, wherein the 
client computer generates at least one request, each request identifying a 
region of interest within the original digital image; 

(c) a server computer coupled to the communication network and the 
storage device, the server computer performing the steps of: 

(i) receiving a first request from the client computer; 

(ii) pre-processing the original digital image in response to the 
receipt of the first request; 

(iii) for each subsequent request received from the client computer: 

(A) compressing the region of interest associated with each 
subsequent request, based upon the pre-processed digital 
image; and 

(B) transmitting the compressed region of interest to the 
client computer via the communication network. 

37. A server computer for transmitting a digital image to a client computer over a 

communication network, the server computer being coupled to an image storage 
device for storing t an original digital image at an original resolution, wherein the 
client computer generates at least one request for interaction with the original 
digital image and at least one subsequent request, each identifying a region of 
interest at a selected resolution within the original digital image, the server 
computer comprising: 

(a) means for pre-processing the original digital image in response to the 
receipt of the first request, the pre-processed digital image having a 
resolution lower than or equal to the original resolution; 

(b) means, for each subsequent request received from the client computer, 
for: 

(i) compressing the region of interest associated with each request, 
based upon: 

( 1 ) the pre-processed digital image, and 
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(2) further processing of portions of the original digital 

image at the selected resolution of the region of interest 
identified in the subsequent request; and 
(ii) progressively transmitting the compressed region of interest to 

the client computer via the communication network. 

38. A server computer for transmitting a digital image to a client computer over a 
communication network, the server computer being coupled to an image storage 
device for storing an original digital image, wherein the client computer 
generates at least one request, each request identifying a region of interest 
within the original digital image, the server computer comprising: 

(a) means for receiving a first request from the client computer, 

(b) means for pre-processing the original digital image in response to the 
receipt of the first request; 

(c) means, for each subsequent request received from the client computer, 
for: 

(i) compressing the region of interest associated with each 
subsequent request, based upon the pre-processed digital image; 
and 

(ii) progressively transmitting the compressed region of interest to 
the client computer via the communication network. 

39. A method for transmitting a digital image from an image storage device to a 
client computer over a communication network, comprising the steps of: 

(a) storing an original digital image at an original resolution in the image 
storage device; 

(b) generating at the client computer at least one request for interaction with 
the original digital image and at least one subsequent request, each 
request identifying a region of interest at a selected resolution within the 
original digital image; 
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(c) pre-processing the original digital image in response to the receipt of the 
first request, the pre-processed digital image having a resolution lower 
than or equal to the original resolution; 

(d) for each subsequent request received from the client computer: 

(i) compressing the region of interest associated with each request, 
based upon: 

(1) the pre-processed digital image, and 

(2) further processing of portions of the original digital 
image at the selected resolution of the region of interest 
identified in the subsequent request; and 

(ii) progressively transmitting the compressed region of interest to 
the client computer via the communication network. 

40. A method for transmitting a digital image to a client computer over a 
communication network, comprising the steps of: 

(a) storing an original digital image; 

(b) generating at the client computer at least one request, each request 
identifying a region of interest within the original digital image; 

(c) receiving a first request from the client computer; 

(d) pre-processing the original digital image in response to the receipt of the 
first request; 

(e) for each subsequent request received from the client computer: 

(i) compressing the region of interest associated with each 
subsequent request, based upon the pre-processed digital image; 
and 

(ii) progressively transmitting the compressed region of interest to 
the client computer via the communication network. 

41 . A computer-readable media for a server computer, programmed to cause the 
server computer to transmit a digital image to a client computer over a 



BNSDOCIO: <WO 0116764A1_I_> 



WO 01/16764 



PCT/US00/23479 



73 

communication network, the server computer being coupled to an image storage 
device for storing an original digital image at an original resolution, wherein the 
client computer generates at least one request for interaction with the original 
digital image and at least one subsequent request, each identifying a region of 
interest at a selected resolution within the original digital image, the computer- 
readable media programmed to performed the steps of: 

(a) pre-processing the original digital image in response to the receipt of the 
first request, the pre-processed digital image having a resolution lower 
than or equal to the original resolution; 

(b) for each subsequent request received from the client computer: 

(i) compressing the region of interest associated with each request, 
based upon: 

(1) the pre-processed digital image, and 

(2) further processing of portions of the original digital 
image at the selected resolution of the region of interest 
identified in the subsequent request; and 

(ii) progressively transmitting the compressed region of interest to 
the client computer via the communication network. 

42. A computer-readable media for a server computer, programmed to cause the 
server computer to transmit a digital image to a client computer over a 
communication network, the server computer being coupled to an image storage 
device for storing an original digital image, wherein the client computer 
generates at least one request, eachrequest identifying a region of interest 
within the original digital image, the computer-readable media programmed to 
perform the steps of:: 

(a) receiving a first request from the client computer; 

(b) pre-processing the original digital image in response to the receipt of the 
first request; 

(c) for each subsequent request received from the client computer: 
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compressing the region of interest associated with each 
subsequent request, based upon the pre-processed digital image; 
and 

progressively transmitting the compressed region of interest to 
the client computer via the communication network. 
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zeroModel_1 6.start_model(); 
zeroModeL4.start_model(); 
zeroCoefModel.start_modelQ; 
coefSignModel.start_model(j; 

while (encoder.getNextGroup0f16()) j 
bool isZero; 

if (encoder.isGroupType16()) [ 
isZero = encoder.isZeroGroup0f16(); 
arithmetic_encode_symbol(ZeroModel_ 1 6,isZero); 

if (isZero) 
continue; 

t 

while (encoder.getNextGroup0f4()) j 

if (encoder.isGroupType4()) j 

if (1encoder.mustbeNoZeroGroup()) { 

isZero = encoder.isZeroGroup0f4(); 
a rithmetic_encode_symbol (Ze roModel_4 , isZero) ; 
if (isZero) 
continue; 

while (encoder.getNext_Type1_Coef (isZero)) \ 

if (1encoder.mustbeNoZeroCoef()) 
arithmetic_encode_symbol(zeroCoefModel,isZero); 

if (1 isZero) 

arithmetic_encode_symbol(coefSignModel,encoder:getCoefSign()); 

if (l(encoder.isLastBitPlane() equalBinSetting)) 
bitModel.start_model(); 

int bit; 

while (encoder.getNextSubordinatePassBit (bit)) 
^ arithmetic_encode_symbol(bitModel,bit); 
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zeroModeU 6.start_model(); 
zeroModel_4.start_model(); 
zeroCoefModel.start_modelQ; 
coefSignModel.start_model(); 

while (decoder.getNextGroup0f16()) | 

if (decoder.isGroupType16()) j 
if (arithmetic_decode_symbol(zeroModel_16)) j 
decoder.zeroGroupOf 1 6(); 
continue; 

! 

else 

decoder.removeZeroGroupOf 1 6(); 

while (decoder.getNextGroup0f4Q) { 
if (decoder.isGroupType4(5) j 
if (1decoder.mustbeNotZeroGroup()) j 
if (orithmetic_decode_symbol(zeroModeL4)) j 
decoder.zeroGroup0f4(); 
continue; 

7 

decoder.removeZeroGroup0f4(); 

while (decoder.getNext_Type1_Coef()) j 
if (decoder.mustbeNotZeroCoefQ) 

decoder.setNextSigCoef(arithmetic_decode_symbolfcoefSignModel)); 
else if (1arthmetic_decode_symbol(zeroCoef Model)) 

decoder.setNextSigCoef(arthmetic_decode_symbol (bitModel)); 

! 

! 

if (1(decoder.isLastBitPlane() && equalBinSetting)) { 
bitModel.start_model(); 
while(decoder.moreSignificantCoef()) 
decoder.setSignificantCoefBit(arithmetic_decode_symbol(bitModel)); 
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for (inty=0 ; j<2 num ^ er ^^ eso ^ ,on= J um P^ ,ze • j f 

for (int t_Resolution=numberOfResolutions-jumpSize, t_y=j ; 
LyzO && LResolution> 1 ; 
LResolution—, l_y=L.y/2-1) j 

if (Ly^nTileY(LResoluthn)) / 

for (int Lx=0 ; f_*< nTileX(LResolution) ; t_x++) 
preprocessSubbancfnie( L.x,Ly,LReso/ution) ; 

i 

if (t_y(mod2)==1) 
break; 

i 

i 
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RecursiveRenderingMorch (int Lresoluiion, int Ly) \ 

if (Ly< highesLy(Lresolution)) 

getCoefficientsOfTileRow (t_resolution,t_y); 

if (Ly > hwesLy(Lresolution)) 

InverseSubbandTransf ormTileRow ( Lresolution, Ly); 

if (Ly > lowesLy(Lresolution) 8ck Lresolution t= dyadicResolution(ROI)) f 
int y = jY - 1; 

if (2xLyz/owesLy(Lreso/ution+f)&8c 
2xLy < highesLy(Lresolution+ 1)) 
RecursiveRenderingMorch (Lresolution+1, 2xLy); 

if (2x Ly+ 1 z lowesLy(Lresoluiion+ 1) kk "~ 
2x Ly+ 1 < highesLy(Lresolution+ 1)) 
RecursiveRenderingMorch (Lresolution+1, 2xLy+1); 

if (2xLy+2< highesLy(Lresolution+l)) 

RecursiveRenderingMorch (Lresolution+1, 2xLy+2); 



! 



i 
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